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use In schizosaccharomyces pombe, expression 



(57) The present invention relates to an inducible 
promoter gene for use in the fission yeast Schizosac- 
charomyces pombe. an inducible expression vector 
containing the inducible promoter gene, Schizosaccha- 
romyces pombe transformed with the inducible expres- 
sion vector and a process for producing a heterologous 
protein by using the Schizosaccharomyces pombe 
transformant. 

The present invention also relates to a secretion 
signal gene for use in Schizosaccharomyces pombe . an 
expression vector containing the secretion signal, 
Schizosaccharomyces pombe transformed with the ex- 
pression vector and a process for producing a heterol- 
ogous protein by using the Schizosaccharomyces 
Porri^e transformant. These inducible promoter gene 
and secretion signal gene are originated from the S. 
PQ'^be gene Involved In expression of the invertase pre^ 
cursor 

The inducible promoter gene is originated from the 
^ region responsible for the catabolite repression of the 
<^ 
O 



CM 

CL 
LU 



invertase precursor. When a Schizosaccharomyces 
gombe transfomiant canying the inducible expression 
vector is incubated, the timing of the production of the 
desired heterologous protein can be controlled by reg- 
ulating the expression of the heterologous protein struc- 
tural gene depending on the presence or absence of a 
specific nutrient (especially glucose). 

The secretion signal gene encodes the secretion 
signal for the invertase precursor When a Schizosac- 
charomyce s pombe transfonmant carrying an expres- 
sion vector containing the secretion signal gene Is incu- 
bated, a heterologous protein fused with the secretion 
signal Is synthesized as a precursory protein in the cell, 
and after intracellular separation of the secretion signal^ 
the heterologous protein is secreted from the cell! 
Therefore, it is possible to obtain the desired protein 
from the cell culture, and the desired heterologous pro- 
tein can be purified easily. 
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Description 

r0001 1 The present invention relates to an inducible promoter gene and secretion signal gene for use in the fission 
yeast Schi7osaccharomvcespombe (hereinafter referredto as S. pombe). an expression '^^"If '"'"9 .^T ?riS 
a process for producing a protein using them. In particular, it relates to a process for producing a desired protein wherein 
the S pombe invertase promoter is used to make it possible to control the timing of the protein production by the 
presence or absence of a specific nutrient through regulated gene expression, and a process for secretory production 
of a desired protein by using the secretion signal gene for the S. pombe invertase precursor „„,^,.,ar 
r00021 S pombe. despite being a eukaiyote. has been studied extensively for its high versatility in genetics, molecular 
biology and cellular biology as a unicellular organism (Nasim A et al. eds.. Molecular biology of the fission yeast 
Academic Press, 1989). In its cultures, monosaccharides such as glucose and fructose are used as the mam carbon 
sources It is known that in a culture medium lacking these monosaccharides, expression of invertase the enzyme 
that degrades sucrose into glucose and fructose, is induced to secure the carbon source necessary for its growth 
(Moreno S. et al.. Arch Microbial. 142, 370, 1985). 

00031 S pombe invertase is located on the cell surface and is a high-molecular weight glycoprotein with a molecular 
weight of about 205000. 67% of which is attributed to sugar chains composed of equimolar amounts of mannose and 
galactose residues. Molecular weight and amino acid studies of the protein moiety of the pure enzyme and expenments 
using antibodies have shown high similarlity between S. pombe invertase and the invertase fr^^^^''"'!?'"^ ^^f^ 
SHCchar omvces cerevisiae from the viewpoint of protein chemistry (Moreno S. et al., Biochem. J. 267, 697. ' 
is also known that a drop in glucose concentration de-represses synthesis of invertase (Mitchinson J. et al.. Cell Sci 

ro0041 ^ tndiced invertase synthesis (de-repression) is also observed in Saccharomyces cerevisiae. Previous detailed 
slud.cs on genetic regulation of invertase expression, the biosynthetic pathway and the structure of the sugar chain 
moic-v have shown that Saccharomyces cerevisiae invertase is encoded by six overlapping genes, SUC1 to SUC5 
ard SUC7. on one chromosome and that activation of at least one of these SUC genes leads to utilization of sucrose 
rtrdraffinose(HohmannS.etal.,Curr Genet 11, 217, 1986). ^^^^^/M^r^nn 
roOOSl In contrast, with respect to S. pombe , although purification of the invertase protein has been reported (Moreno 
S ri al 1^85) no invertase genes had been identified until the present inventors and coworkers recently reported 
Jwo overlapping invertase genes invO- and invl - in S. pombe . Because invO* is likely a pseudogene having an imcom- 
plclo open reading frame. inv1* is the only one gene encoding S. pombe invertase, which is supposed to confer the 
ability to grow on sucrose even in the absence of other carbon sources ("Kobogaku" edited by Yositaka Hashitani. 

[OOOer Analysis oHhe promoter region of the isolated gene suggested that a specific sequence between the 1 st and 
62nd base pairs is involved in catabolite repression. .^okm..^ fr«m Hiff^r 

rOOOTl in Saccharomyces cerevisiae the SUC2 gene is transcribed into two messenger RNAs (mRNAs) fo"^ differ- 
ent transcription initiation sites. The shorter one is a constitutive mRNA encoding the intracellular invertase, whi e the 
longer one is a mRNA encoding the catabolite-repressible secretory invertase with a de-repression ratUD of not less 
than 200 (Carlson M . et al. , Mol. Cell. Biol. 3, 439. 1 983). Analysis of the promoter region forthe longer ""RNAsuggested 
that the transcription initiation factor binds to a specific repeated sequence between positions -650 and -418 (Salok^ 
L ot al.. Mol. Cell. Biol. 6, 2314, 1986). The region between positions -418 and -140 has been shown to be necessary 

looOBWhS'regions in the SUC2 gene showed no significant homology with the invl- "Pst^««"Vf^|°" ^/**';f.®" 
positions 1 and 2809. However, multiple copies of a so-called 7-bp motif with the sequence (A/C) (A/G)GAAAT. which 
is repeated at five sites in the region indispensable for glucose derepression, have been found in the invU upstream 
region Further, while palindrome stem-loops have been identified at almost the same positions in the upstream regions 
of qlucose-repressible genes (SUC. MAL and GAL), palindrome sequences have also been found in the upstream 
region of the invl - genefrom S. pombe . These sequences are anticipated to play an important role in glucose repression 

rO OOsT^The yeast S. pombe is phylogenetically different from Saccharomyces cerevisiae. It is quite different from 
other yeasts in the chromosome structure and various mechanisms for genome replication, RNA splicing transcnption 
and posttranslational modification, and rather resembles animal cells in some of these aspects. For this reason, 
ponnbc is widely used as a eukaryotic model (Giga-Hama and Kumagai, eds., Foreign gene expression in fission yeast 

Schizosacch aromyces pombe . Springer-Veriag, 1997). 

rOOIOl S pombe is also wide ly used as a host for expression of heterologous protein genes and known to be suited 
especiall y for expre ssion of genes from animals including human (JP-A-5-1 5380 and JP-A-7-1 63373). For its advanced 
membrane stmctures including the Goldi body and the endoplasmic reticulum, S. pombe is also ^''P^^^^^" 
of membrane proteins and showshighlevelexpression.ForSjeomte.constitutiveexpressionvectors^^^^^^^^ 

and pTL2M) and an inducible expression vector using the promoter region of the nmt1+ gene (pREPI) are usually 
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used as expression vectors. No S. pombe expression vectors of the GAL type or the SUC type have been known 
though these types of vectors are commonly used for Saccharomyces cerevlsrae . 

[0011] The expression of the SUC2 gene from Saccharomyces cerevlsiae in S. pombe has been shown to be con- 
stitutive, not catabolite repressible, though the expression product contains galactose residues conferred by the host 

5 (Zarate, V. et al., J Applied Bacteriology 80, 45, 1996), suggesting differences between S. pombe and Saccharomyces 
cerevlsiae in the mechanism for catabolite repression of Invertase. The differences are of great significance because 
the promoter from Saccharomyces cerevlsiae usually used by those skilled in the art for constmction of inducible 
expression vectors of the invertase type (the SUC2 type) is not applicable to S. pombe vectors. Therefore, development 
of S. pombe vectors of this type has been long delayed. 

10 [0012] On the other hand, the present inventors constructed an expression vector using the secretion signal gene 
encoding the secretion signal in the precursor of a S. pombe mating pheromone (WO96/23890). However, this secretion 
signal gene is not an all-purpose secretion signal gene, and other secretion signal genes that function in S. pombe are 
desired for production of some types of protein. 

[0013] As a result of their extensive research with a view to solving the above problems, the present inventors have 
accomplished the present invention by preparing a new clone of the S. pombe invertase gene and constructing an 
inducible expression vector. They have also found that the N-temninal 22 amino acid sequence in the amino acid of 
the invertase precursor functions as a secretion signal. On the basis of these findings, they have constructed an ex- 
pression vector using the secretion signal gene and established secretory production of desired proteins. 
[0014] The present invention relates to a region in the invertase gene from S. pombe , which is involved In catabolite 
repression, an inducible expression vector using the region and a system using it for heterologous gene expression 
and provides: 
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a DNA in an invertase gene from Schizosaccharomyces pombe , which is located in a region involved in catabolite 
repression, 

25 a DNA having the base sequence of bases 1 to 2809 in SEQ ID NO; 1 in the Sequence Listing, 

a recombinant vector containing the sequence of the DNA, a multicloning vector containing the sequence of the 

DNA and a multicloning site, 

a multicloning vector having the structure shown in Figure 9, 

an expression vector for transformation of Schizosaccharomyces pombe containing the sequence of the DNA and 
50 a heterologous protein structural gene, 

a transformant from Schizosaccharomyces pombe containing the expression vector, and 

a process for producing a protein which comprises incubating the transfomiant and recovering an expressed het- 
erologous protein. 

35 [0015] Firstly, the present inventors cloned and sequenced a S. pombe invertase gene, which had not been genet- 
ically identified. Then, they demonstrated by gene disruption analysis that the invertase gene is responsible for the 
overall invertase activity. Further, they identified the region involved in catabolite repression and constructed an induc- 
ible expression vector using the region. They actually constructed a recombinant vector carrying the gene of a green 
fluorescent protein, transfonned S. pombe with the vector and conflnned the expression of the protein by assay of 
invertase activity and immunological analysis. They also demonstrated repression of the heterologous gene expression 
in the presence of glucose in the culture medium and derepression by exhaustion of glucose. 
[0016] The present inventors used the following procedure to Identify and characterize the gene of the S. pombe 
invertase precursor: 
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(1 ) PGR using a cDNA library from S. pombe as a template and primers based on conserved amino acid sequences 
in Invertase genes from many other organisms; 

(2) screening of a genomic library from S. pombe by plaque hybridization using the PGR product as a probe for 
positive clones; 

(3) confirmation of the positive clones by restriction digestion followed by electrophoresis; 

(4) Southern hybridization analysis and total sequencing of a fragment with a specific length in the positive clones; 

(5) gene disruption analysis of invertase activity; 

(6) investigation of the optimum pH for expression of the Invertase gene from S. pombe and the effects of the 
glucose concentration on glucose repression and derepression; and 

(7) identification of a region indispensable for glucose repression through subcloning of the related upstream re- 
gion. 

[0017] Also, the present inventors constructed a 8. pombe invertase Inducible expression vector by the following 
procedure and actually demonstrated inducible expression of a green fluorescent protein: 
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(1 ) construction of an inducible multidoning expression vector pRIOM containing an invertase promoter by modi- 
fying a S^pombe multicloning vector, pTL2M (JP-A-7-1 63373); 

(2) construction of an inducible expression vector pRIOEGFP for expression of a green fluorescent protein variant 
from the inducible multidoning vector, pRIOM; 

5 (3) transformation of a wild-type S. pombe strain with the inducible expression vector. pRIOEGFP. for expression 

of the green fluorescent protein variant; . 

(4) demonstration of the expression of the green fluorescent protein variant by activity (fluorescence) analysis and 
SDS-PAGE-western blotting; and 

(5) establishment of the conditions for the Inducible expression on the basis of the dependence of the expression 
10 level on the glucose concentration in the culture medium. 

[00181 SEQ ID NO: 1 in the Sequence Listing is the base sequence of the gene of the invertase precursor, which 
contains a region involved in catabolite repression. The region involved in catabolite repression is the DNA sequence 
between positions 1 to 2809 or within the DNA sequence. In the DNA sequence between positions 1 and 2809, the 
15 region extending from position 1 to position 620 and the region extending from position 1610 to position 2610 are 
especially important, as is evident from the results of the analysis in Example 6 shown in Figure 8 (position 2810 in 
SEQ ID NO- 1 corresponds to position 1 In Figure 8). This means that the inducible promoter in the present invention 
is not restricted to a DNA having the base sequence from position 1 to position 2809 so long as it contains these genes 
involved in catabolite repression and functions as an inducible promoter. Still, a DNA having a base sequence from 
20 position 1 to position 2809 is preferred as an inducible promoter because it actually functions in S. pombe. 

[00191 The above-mentioned DNA which contains genes involved in catabolite repression and function as an induc- 
ible promoter, preferably having the base sequence from position 1 to position 2809, is hereinafter referred to as the 
inducible promoter gene. The inducible promoter gene can be integrated with a vector for construction of recombinant 
vectors such as multidoning vectors and expression vectors. A multidoning vector is a vector having a multidoning 
25 site and provides an expression vector through introduction of a desired structural gene into the multidoning site. An 
expression vector is a vector containing a structural gene and used for expression of a strudural gene encoding a 
heterologous protein. A "heterologous" protein is a protein which is not inherent In the host. For example, when the 
host is S pombe , a heterologous protein is a protein which is not inherent in S. pombe (such as a human protein). 
[0020] In the expression vedor. the inducible promoter gene Is located upstream from the heterologous protein struc- 
30 tural gene and regulates expression of the structural gene. The inducible promoter gene in the expression vector 
regulates the expression of the heterologous protein stmctural gene downstream, like the inducible promoter gene, 
located upstream In the base sequence represented by SEQ ID NO: 1 regulates the expression of the strudural gene 
of the invertase precursor In the multidoning vector, the indudble promoter gene is located upstream from the multi- 
doning site into which a heterologous protein strudural gene is to be introduced. » 
[0021 1 O ne example of the multidoning vector of the present Invention is the multiclonig vector pRIOM constructed 
in Example 7 and has the strudure shown in Figure 9. The entire base sequence of pRIOM is SEQ ID NO: 2. Inv1-P 
is the above-mentioned inducible promoter gene, and MCS is the multidoning site. One example of the expression 
vedor d the present Invention is the inducible expression vector pRIOEGFP for expression d a green fluorescent 
protein variant constmded in Example 8 by Introdudng the strudural gene (EGFP-ORF) of a green fluorescent protein 
variant and has the strudure shown in Figure 10. The entire base sequence of the expression vedor pRIOEGFP is 

PW^I The most suitable cell (host) to transform with the expression vector of the present invention is S. pombe 
because the indudble promoter gene in the present invention is an inducible promoter gene from S. pombe . 
[0023] Under catabolite repressing conditions (for example, in a culture medium containing a high level of glucose), 
S pombe transfomied with the expression vector of the present Invention grows with no (or low) expression d the 
heterologous protein. Growth at this stage without the burden of heterologous protein expression Is more effidentthan 
growrth under the burden. Subsequent incubation under catabolite derepressing conditions (for example, m a culture 
medium containing no or a low level d glucose) invites the increased number of S. pombe cells to high level expression 
of the heterologous protein, though growth d S. pombe is less efficient than under catabolite repressing conditions. 
so Thus, controlled transition between growth of S. pombe and heterologous protein expression through catabolite re- 
pression allows more efficient production of a heterologous protein. 

[0024] Catabolite repression can be controlled not only in an active way as described above but also in a passive 
way For example when a S. pombe transformant is incubated in a culture medium containing a given amount of 
glucose the S pombe grows under catabolite repressing conditions containing a high level d glucose in the initial 
55 stage but later on production of a heterologous protein predominates due to catabolite derepression as glucose is 
exhausted. This way, more efficient heterologous protein production d than ever is possible without adive control d 

the glucose level. . 

[0025] In addition to the above-mentioned total sequencing of the invertase precursor gene from S. pomti e. ine 
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present inventors determined the complete amino acid sequence of the invertase precursor (the amino acid sequence 
in SEQ ID NO: 1 ). Then, they have found that the first 22 amino acid peptide in the amino acid sequence of the invertase 
precursor (Met Phe Leu Lys Tyr lie Leu Ala Ser Gly He Cys Leu Val Ser Leu Leu Ser Ser Thr Asn Ala) acts as a secretion 
signal. Hereinafter, the peptide is referred to as the secretion signal. 
5 [0026] It is expected that a desired heterologous protein produced in transformed S. pombe cells as a protein fusion 
having the secretion signal at the N-termlnal is secreted from the cells after intracellular processing which splits the 
protein fusion into the secretion signal and the heterologous protein. The present Inventors constructed an expression 
vector carrying a heterologous protein structural gene (specifically, human interleukin 6-a'c1 variant) fused with a DNA 
encoding the secretion signal (namely, a structural gene of a protein fusion as mentioned above) and demonstrated 

10 secretion of the heterologous protein from S. pombe cells transformed with the expression vector. 

[0027] The present invention provides the secretion signal, a DNA encoding the secretion signal (hereinafter refen-ed 
to as a secretion signal gene), a recombinant vector carrying the secretion signal gene, a multicloning vector carrying 
the secretion signal gene, an expression vector carrying the secretion signal gene and a heterologous protein structural 
gene for transfomriation of S. pombe , a S. pombe transfonnant carrying the expression vector and a process for pro- 

15 ducing a protein which comprises incubating the transformant and recovering the expressed heterologous protein. 
[O028] The secretion signal gene Is not restricted to the 66-bp sequence extending from position 2810 to position 
2875 in SEQ ID NO: 1 and may be a DNA having a different base sequence encoding the amino acid sequence of the 
secretion signal. In the expression vector, the secretion signal and the heterologous protein structural gene Is preferably 
linked directly. But they may be linked via another DNA sequence, for example, extending from position 2876 in SEQ 

20 ID NO: 1 . In this case, the protein product has extra amino acid residues at the N-terminal of the heterologous protein 
but can be converted into the desired heterologous protein by trimming off the N-temninal extra amino acid residues. 
However, the disadvantage from the presence of these extra amino acid residues usually becomes more serious for 
the desired protein as the number of extra amino acid residues increases. Therefore, as the intervening DNA between 
the secretion signal gene and the heterologous protein structural gene, a short DNA encoding at most 10 amino acid 

25 residues is preferable. Particular preferably, the secretion signal gene and the heterologous protein structural gene 
are linked directly. 

[O029] Construction of an expression vector carrying the secretion signal gene using a multicloning vector can be 
attained by inserting a heterologous protein structural gene fused with the secretion signal gene into the multicloning 
site of a multicloning vector or by inserting a heterologous protein structural gene into the multteloning site of a multl- 
30 cloning vector carrying the secretion signal gene. The latter method tends to restrict the structure of the multicloning 
site because the secretion signal gene is preferred to be located immediately in front of the multicloning site as described 
above. Therefore, the former method is preferred for construction of an expression vector. As a 8. pombe multk:loning 
vector, for example, pTl-2M, which is disclosed In JP-A-1 63373, is preferable. 

[0030] According to the present invention, an expression vector can be constructed by using both the inducible 
35 promoter gene and the secretion signal gene. For example, an expression vector which contains the DNA sequence 
of from position 1 to position 2875 in SEQ ID No: 1 and a heterologous protein structural gene Introduced downstream 
of the DNA sequence can be constructed. Such an expression vector enables catabolite repressible secretory produc- 
tion of a heterologous protein by the host cell. A similar expression vector can be constructed by using a known secretion 
signal gene (such as the secretion signal gene disclosed in WO96/23890) Instead of the above-mentioned secretion 
40 signal gene. 

[O031] Figure 1 shows a comparison of (partial) amino acid sequences deduced from inv1+, the Schwann lomyces 
occidentalis invertase gene and the fission yeast SUC2 gene. Figure 2 is the restriction map of the invl + gene. Figure 
3 electrophoretically shows disruption of the invl + gene. Figure 4(a) is a photograph of colony gel overlay assay of 
Invertase activity (for phenotype characterization) as a substitute for a drawing. Figure 4(b) is a schematic explanation 
45 of experimental design of the Invertase activity assay shown in Figure 4(a). Figure 5(a) is a photograph of colony gel 
overlay assay of Invertase activity (for phenotype characterization) as a substitute for a drawing. Figure 5(b) Is a sche- 
matic explanation of experimental design of the invertase activity assay shown in Figure 5(a). 

[0032] Figure 6 graphically shows the relation between the invertase activity and the glucose concentration. Figure 
7 graphically shows the relation between the invertase activity and the glucose concentration. Figure 8 shows the 

so results of analysis of invertase promoters. Figure 9 show the structure of an inducible expression vector pRIOM. Figure 
10 shows the structure of an inducible expression vector pRIOEGFP for expression of a green fluorescent protein. 
[0033] Figures 11(a) and 11(b) demonstrate the expression of a green fluorescent protein. Figures 12(a) and 12(b) 
graphically show the relation between the incubation time and the expression level of the green fluorescent protein. 
Figures 13(a) and 13(b) graphically show the relation between the incubation time and the expression level of the 

55 green fluorescent protein. Figures 14(a) and 14(b) show the relation between the incubation time and the expression 
level of the green fluorescent protein. Figure 15 is a SDS-PAGE pattern obtained in analysis of the expression of 
interleukin 6a'c1 variant. Figure 16 is the western blot pattem of the expressed interleukin variant. 
[0034] Now, the present invention will be described in further detail with reference to specific Examples. 
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EXAMPLE 1 

Isolation of S. pombe invertase gene 

5 [0035] PGR using a S. pombe cDNA library as the template and primers designed on the basis of conserved se- 
quences in invertase genes from other organisms shown in SEQ ID NOS: 3 to 5, gave amplification products of about 
SOObp and about 400bp. Each PGR product was purified by using EASY TRAP (TakaraShuzo Go., Ltd.) and sequenced 
after ligation into a vector by using pMOS Blue T vector kit (Amersham Pharmacia Biotech K.K.). The deduced amino 
acid sequence indicated that part of the 400-bp PGR product has significant homology with the SUG2 gene from 

10 Saccharomyces cerevisiae . 

[0036] Screening of a genomic library of S. pombe for the entire invertase gene by plaque hybridization using the 
400-bp PGR product as a probe picked up 15 positive clones from about 8,000 plaques. Secondary screening of the 
positive clones by plaque hybridization left four positive clones. Treatment of small amounts of phage DNA extracts 
from the positive clones with various restriction enzymes gave identical cleavage pattems, and thus revealed that all 

15 the clones were identical. 

[0037] The entire invertase gene was isolated by the following two-step procedure. A 3.0-kb Hindlll fragment was 
isolated from the hybrid-forming clones and ligated into a plasmid pBluescript II SK- (Toyobo Go., Ltd.). Restriction 
mapping identified BamHt and Sail sites. Subcloning of the fragment using these restriction enzymes and subsequent 
sequencing using a deletion technique revealed that the Hindlll fragment contains the complete ORF of the gene but 

20 contains only about 200 bp within the upstream region, which is supposedly involved in the gene expression. Therefore, 
separately, a 3.5-kb BamHI fragment from the hybrid-forming clones was further digested with Hindlll to give a 2.6-kb 
fragment. Subcloning of the 2.6-kb fragment in plasmid pBiuescript 11 SK-, and subsequent sequencing using a deletion 
technique revealed that the BamHi-Hlndllt fragment contained a sequence within the upstream region supposedly 
involved in the gene expression. The resulting complete 5.6-kb gene was designated as inv1+. The base sequence of 

25 inv1 and the amino acid sequence encoded by its ORF are shown in SEQ ID NO: 1 . A plasmid carrying the complete 
gene was designated as plNV3000. 

[0038] These results suggest that the inv1+ product has 16 asparagine-l inked glycosylation sites. Figure 1 shows a 
comparison of the amino acid sequence deduced from the base sequence of the inv1+ gene from S. pombe , the amino 
acid sequence of Schwann iomyces occidentalis invertase and the amino acid sequence deduced from the base se- 
30 quence of the SUC2 gene from Saccharomyces cerevisiae . Amino acids that are common to the three are marked 
with *. Figure 1 clearly shows the amino acid sequence deduced from the invl base sequence has significant homology 
with invertases from other origins such as Schwann iomyces occidentalis and Saccharomyces cerevisiae , which sug- 
gests the invl^ encodes invertase. 

35 EXAMPLE 2 

Disruption of the invl-*- gene 

[0039] The Hindlll site in plasmid pBluescript II SK- having a S. pombe ura4+ gene insert at the Glal site was disrupted 

40 by Hindlll digestion followed by blunting and self-ligation (self-cyclizaion). Double restriction digestion of the plasmid 
with Xbal and Hindi gave a ura4+ fragment. The plasmid pBluescript was integrated with the ura4+ fragment after 
restriction digestion with Spet and subsequent blunting and Xbal digestion, to provide a plasmid having BamHI sites 
on both sides of ura4'^. The BamHt site in plasmid pBluescript II SK- was disrupted similarly by restriction digestion 
followed by blunting and self-ligation, and the 3.0-kb fragment containing the inv1+ ORF was inserted at the Hindlll 

45 site. BamHI digestion of the plasmide eliminated a 1 .4-kb fragment (containing part of the inv1+ ORF encoding the C- 
tenninal of invertase) from the 3.0-kb insert, and a ura4-^ cassette having BamHI sites at both ends was Inserted. Hindlll 
digestion of the resulting plasmid gave a DNA fragment having invl-*- neighboring regions at both ends (Figure 2). The 
restriction map of the invl + gene is shown in Figure 2, wherein the open reading frame (ORF) Is indicated by the arrow 
(invl+ORF) and the ura4 replacement from Schizosaccharomyces pombe is boxed (ura4+). The disruption mutant 

50 strain had an inv1 + fragment carrying the S. pombe ura4"*- gene instead of the 1 .4-kb invl^ BamHI-BamHI fragment 
partly containing the ORF. The invl-^ fragment was used to transfonn a wild-type S. pombe strain, TP4-1D [h", leul , 
ura4, ade6-M216, his2, obtained from Dr. Takashi Toda (Imperial Gancer Research Foundation)], and viable colonies 
on a uracil-free culture medium were collected. Overiay assay of invertase activity revealed that 7 out of 28 strains, 
namely 25% of the ura4+ colonies, lacked invertase activity. 

55 [0040] Further, to verify the chromosomal invl + gene disruption, genomic DNA from a strain lacking invertase activity 
was analyzed after double restriction digestion with Hindlll and Sail by Southern hybridization using the inv1+ Hindl- 
It-Sall fragment (2 kb) as the probe. The 3-kb hybridized fragment, which was not digested with Sail , shown in Figure 
3 demonstrates that part of the inv1+ gene in the chromosomal DNA had been replaced with the ura4+ gene in the 
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strain which lacked invertase activity. 

[0041] Thus, the mvl^-gene proved to be the only one invertase gene expressed In S: pombe - - 
EXAMPLE 3 

5 

Restoration of invertase activity by the inv1+ gene 

[0042] The 3.0-kb Hindlll fragment containing the entire inv1+ ORF from the Invertase gene was inserted into S. 
pombe vector pAU-SK (obtained from Dr Chikashi Shimoda, Department of Science, Osaka City University), and the 

10 resulting recombinant vector was used to transform the invertase-defective strains (Example 2). The resulting trans- 
fonnants were streaked on YP sucrose plates (supplemented with 1 0 ^ig/mf antimycin A and 20 \v^me bromocresoi 
purple) for overlay assay of invertase activity. Further, the 2.6-kb inv1 + BamHI-Hlndill fragment containing the upstream 
promoter region and the 2.0-kb Hindlll-Sall fragment from the invertase gene containing the ORF were legated, and 
the resulting 4.6-kb BamHf-Sall fragment was Inserted into pAU-SK. Transfomnation of the resulting recombinant vector 

15 into the invertase-defective strain was followed by similar overlay assay of invertase activity. 

[0043] Both transformants (Inv1 A[pAU-SK::inv1 +]) and the wild-type strain TP4-1 D (WT) developed blue stains, which 
Indicate invertase production, unlike the Invl + disruption mutant strain (invl A). The addition of the upstream promoter 
region resulted In stronger stains, which suggest high-level invertase expression (Figure 4(a) and 4(b)). Figure 4(a) is 
a photograph showing the results of the gel overlay assays, and Figure 4(b) is a schematic explanation of the stained 

20 sections shown in Figure 4(a). 

[0044] The invertase-defective strain were hardly viable on YP sucrose plates (supplemented with 1 0 jig/m€ antimy- 
cin) whereas the wild-type strain and the transformants were recognizably viable after 5 days incubation at 30*'C (Figure 
5(a) and (b)). Figure 5(a) is a photograph showing the results of characterization by colony formation, and Figure 5(b) 
schematically explains the characterization shown in Figure 5(a). 

25 [0045] These results demonstrate that the inv1 + gene expression product is the invertase located on the cell surface. 

EXAMPLE 4 

Determination of glucose concentration for gene repression 

30 

[0046] For determination of the critical glucose concentration for catabolite repression of the invertase gene, the 
wild-type strain TP4-1 D was incubated at 30°C in 5 m^ of MM medium containing 2%, 4%, 8% and 16% glucose with 
shaking to the mid-logarithmic growth phase. Invertase assays were done by the method of Goldstein et al., and the 
post-incubational glucose concentrations in the medium were determined by the phenol -sulfate method (Figure 6). 
35 The hatched bars indicate invertase activity per cell (U/OD), and the empty bars indicate the residual glucose concen- 
tration. Judging from the graph, a glucose concentration of 8% is the optimum for glucose repressing incubation, be- 
cause when the glucose concentration was 8%, the invertase activity was sufficiently repressed with little decrease of 
glucose. 

40 EXAMPLE 5 

Determination of glucose concentration for induced invertase production 

[0047] For detemninatlon of the most effective glucose concentration for induced invertase production, the wild-type 
45 strain TP4-1 D and a transfonnant [obtained by transfomnlng the invertase-defective strain (Example 3) with a pAU-SK 
vector carrying the lnv1+ BamHI-Sall fragment] were prelncubated in a medium containing 2% glucose to the mid- 
logarithmic growth phase and incubated In an MM medium containing 0%, 0.01%, 0.05%, 0.1%, 0.25%, 0.5%, 1.0% 
and 2% glucose with shaking at 27**C for 3 hours, and the invertase activity was assayed (Figure 7). Each run of assay 
was carried out at 30^0 over 180 minutes. 1U of invertase converts 1 nmol of sucrose into glucose per 1 minute at 
50 30°C, pH 4.0. 

[0048] The optimum glucose concentration for Induction was found to be 0. 1 % for the wild-type strain and 0.05% for 
the transformant. The invertase activity in the wild-type strain was 40 times higher under derepressing conditions than 
under repressing conditions. These results demonstrate catabolite repression in S. pombe . 
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EXAMPLE 6 

Analysis of the invl* promoter region 

r00491 Fragments obtained ligating S. pombe inv1 ^ upstream sequences extending from positions 1 , 620, 11 00. 
5 61 0 ind 26?rrespectively. and tine in^;?^ were inserted Into expression vector pAU-SK to obta.n 5 pl^m-ds or 
deletion studies. The plasmlds containing the upstream sequences extending from positions 1 . 1610 2610 car^^ 
the Invl- BarpHI-Sall. Sacl-Sall and Hindlll-Sall fragments, respectively. The plasmids contam.ng the .nv1* upstream 
sequences emending from positions 620 and 1100 were constructed by site specific introduction of a Spe s,te mto 
rAu lK^invIM^^^^^ primers shown in SEQ ID NOS: 6 and 7. respectively, followed by partial removal 

of the uDstream reaion bv Spel treatment. _ . . _^ 

[0050] The plasmids thus obtained were used to transform the invertase-defective strain (Example 3). Invertase 
assays were doneto detemiine the enzyme activity in each transfomiant (Fig. 8). The results suggest ttiatt^ 
between position 1 and position 620 Is essential for glucose repression. The region between position 1620 and position 
>5 261 0 was identified as essential for high-level glucose derepression of invertase. 

EXAMPLE 7 

Construction of inducible multicloning expression ve c tor pRIOM carrying the invertase promoter 

r0051 1 PCR amplification using the plasmid plNV3000 (Example 1 ) carrying the invertase gene from S. pombe as 
he template and ollgo DNAs shown in SEQ ID NOS: 8 and 9 as the primers was perfomned to give a sequence wh«h 
contains the promoter region forthe invertase gene and has restriction enzyme --^^^^a" '-on sequences at both ends^ 
After terminal double restriction digestion with Spel (Takara Shuzo Co.. Ltd.) and EcoRI (Takara Shuzo Co Ltc^), the 
sequence was subjected to agarose gel electrophoresis. Purification of the band of about 3000 bp by the glass beads 
method using EASY-TRAP gave an insert fragment. , , . u 

[00521 The S. pombe multicloning vector pTL2M (JP-A.7-1 63373) was subjected to agarose gel electrophoresis after 
[em,inal doub-te^iw5ion digestion with Spel and Ecol. Purification of the band of about 4500 bp by the glass beads 
method using EASY-TRAP gave a vector fragment. ^ c ^„ii 

So531 The two fragments were ligated with a DNA ligation kit (Takara Shuzo Co.. Ltd. and transformed into E^c^^ 
strain DH (Toyobo Co., Ltd.). E. coli colonies were screened for the inducible expression vector pRIOlM shown n 
Figure 9 and SEQ ID NO: 2 in the Sequence Listing through base sequencing and restriction mapping, and the inducible 
expression vector was recovered by the alkali-SDS method on a preparatory scale. 

35 EXAMPLE 8 

Construction of inducible expression vector pRIOEQFP f or expression of green fluorescent protein 

[00541 PCR amplification using the plasmid plNV3000 (Example 1) carrying the invertase gene from S. pombe as 
he template and oligo DNAs shown in SEQ ID NOS: 8 and 1 0 as the primers was perfomied to give a sequence which 
contains the promoter region forthe invertase gene and has restriction enzyme recognition sequences at both ends_ 
Aner lenilnardouble resUlction digestion with Spe. and Nhel (Takara Shuzo Co., Ltd.). the -^^^f 
to agarose gel electrophoresis. Purification of the band of about 3000 bp by the glass beads method using EASY-TRAP 
aave a fraament for use as the promoter insert. • * ^ 

[00551 PCR using the plasmid pEGFP carrying the Jellyfish (Aequorea victria) green fluorescerit protein variant gene 
Clontech) as the template and ollgo DNAs shown In SEQ ID NOS: 11 and 12 In ^^^t ^1?.^''% Tlf to 
was performed to amplify the ORF in the green fluorescent protein variant gene. The PCR product was sub ected to 
agarose gel electrophoresis after tem^inal double restriction digestion with Nhel and Hindlll. Punfication of the band 
of about 700 bp by the glass beads method using EASY-TRAP gave a fragment for use as the ORF insert. 
[0056] The S pombe multicloning vector pTL2M was cleaved by double restriction digestion with Spel and HindlM 
and then subJeli^S^garose gel electrophoresis. Purification of the band of about 4500 bp by theglass beads method 

usinq EASY-TRAP gave a vector fragment. . 
[00571 The three fragments were ligated with a DNA ligation kit and transformed into E. col. stram DH5 E. col. 
colonies were screened for the inducible expression vector pRIOEGFP for expression of the green fluorescent prote.n 
variant shown in Figure 10 and SEQ ID NO: 13 in the Sequence Listing through base sequencing and restriction 
mapping and the inducible expression vector was recovered by the alkali-SDS method on a preparatory scale. 
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EXAMPLE 9 



Preparation of S. pombe transformant ASP138 

5 [0058] S. pombe wild-type strain ARC001 [Ieu1 -32h- (isogenic to ATCC38399)1 was transformed with the inducible 
expression vector pRIOEGFP for expression of the green fluorescent protein variant and a transducing vector pAL7 

as described by Okazaki et al. (Okazaki et al., "Nucleic Acids Res.", 18, 6485-6489, 1990). 

[0059] 1 of a preincubated ARC001 culture in YPD medium was incubated in minimum medium MB+Leu with 
shaking at SO'C for 14 hours to a cell density of 3x10^ per 1 m€, and the cells were collected, washed with water, 

10 suspended In 1 m^ of 0.1 M lithium acetate (pH 5.0) and incubated at SO^^C for 60 minutes. A 100 portion of the 
suspension was mixed with 4 p.g of the Inducible expression vector pRIOEGFP and 0.5 i^g of Pstl-digested pAL7 in 15 
\i€ TE and further with 290 \i€ of 50% PEG 4000 thoroughly and incubated at 30°C for 60 minutes, at 43**C for 15 
minutes and at room temperature for 10 minutes, successively. After centrifugal removal of PEG, the cells were sus- 
pended in 1 m€ of 1/2 YEL+Leu medium. After 10-fold dilution, 1 m€ of the suspension was incubated at 32*C for 2 

15 hours, and a 300 \ie portion was spread on minimum medium agar MMA. After 3 days of Incubation at 32 **C, about 
300 independent colonies had developed on the plate. 

[0060] 1 0 colonies of the transformant were inoculated in 2 mf of YEL medium containing 1 0 [ig/m€ antibiotic G41 8 
(YEL1 0 medium) and incubated with shaking at 32''C. 2 days later, 6 clones were viable. In their subcultures, 4 clones 
were viable 3 days later. The putative desired transformant (ASP138 strain) was frozen in glycerol and stored for use 
20 in subsequent experiments. 

EXAMPLE 10 

Analysis of expression of green fluorescent protein variant 

25 

[0061] S. pombe transfomnant ASP138 (Example 9) was inoculated in YPD medium containing 100 ^.g/m€ G418 
(YPD100) and Incubated at 32°C for 2 days. Green fluorescence was observed from each cell under a fluorescence 
microscope (excitation wavelength 490nm/emission wavelength 530nm). Green fluorescence emission was also ob- 
served upon ultraviolet irradiation from the centrifugally collected cells. Thus, expression of the desired green fluores- 
ce cent protein in the active form was confirmed. 

[0062] Strain ASP138 (Example 9) was incubated in 5 m€ YPD100 at 32°C for 3 days, collected, washed and sus- 
pended in 50mM tris-HC€ (pH 7.6), disrupted with glass beads in a mini bead beater (Biospec). After removal of the 
glass beads, the cell extract was heated in the presence of SDS (1%) at 80°C for 15 minutes. Separately, a negative 
control was extracted from the transformant carrying pRIOM by the same procedure. 
35 [0063] 50 \ig protein from the extract was analyzed by SDS-polyacrylamide gel electrophoresis (Figure 11(a) and 
(b)). After Coomassie Brilliant Blue (CCB) staining, the extract and a recombinant green fluorescent protein (Clonetech) 
as the positive control showed major bands with a molecular weight of 25000, but the negative control did not. Further, 
50 ^g protein from the extract was analyzed by SOS-potyacrylamide gel electrophoresis followed by western blotting 
on a PVDF membrane using an anti-green fluorescent protein antibody (Clonetech). The cell extract and the positive 
40 control showed major bands with a molecular weight of 25000, but the negative control did not. These results provide 
biological evidence of expression of the desired green fluorescent protein. 

EXMAPLE11 

43 Optimization of the Incubation method (1) 

[0064] The transformed S. pombe strain ASP (Example 9) was incubated in YPD medium containing 100 \ig/m€ 
G418 (YPD100) at 32**C. The expression level of the green fluorescent protein in the cell culture was determined 
fluorometrically by means of a microplate reader (Corona Electric Co., Ltd.) equipped with a fluorescent attachment 

50 (excitation wavelength 490nm/ emission wavelength 530nm) (Figure 12(a) and (b)). OD, FLU/GD and time denote the 
cell density, the fluorescence Intensity per cell and the incubation time, respectively. The results show that strain 
ASP138 did not express the green fluorescent protein variant until the latc-growth phase after glucose exhaustion In 
the mid-growth phase, unlike strain AS PI 22 having a non-inducible cytomegalovirus promoter [a transformant carrying 
a recombination product of phGFPS65T (Clonetech) for expression of a green fluorescent protein variant of S65T 

55 prepared as disclosed in JP-A-7-1 63373], clearly due to repression of the Inducible invertase promoter in the presence 
of glucose and subsequent derepression by glucose exhaustion, demonstrating the applicability of this mechanism to 
expression of the green fluorescent protein as a heterologous protein. 
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EXAMPLE 12 

incubation of strain ASP138 (2) 

5 [0065] The transformed S. pombe strain ASP1 38 (Example 9) was incubated in YPD medium containing 1 00 jxg/mf 
G418 (glucose concentration 8%) at 32''C to the mid-growth phase, and after medium change. Incubated in YPDG 
medium containing 100 \ig/me G41 8 (glucose concentration 0.1%, glycerol concentration 3%) (Figure 13(a) and (b)). 
OD, FLU/OD and time denote the cell density, the fluorescence intensity per cell and the incubation time, respectively. 
The results show that while the green fluorescent protein was not expressed in the cells Incubated without medium 

10 change (Untreated), expression of the green fluorescent protein in the cells incubated with the medium change was 
activated by the medium change (Medium-changed) probably because the depletion of glucose in the medium provoked 
derepression of the invertase promoter and thereby induced the protein expression. The high level expression induced 
by the medium change to a low-glucose expression medium suggests that use of a growth medium (with a high glucose 
concentration) and an expression medium (with a low glucose concentration) can differentiate between cell growth 

IS and protein expression. It was demonstrated that the repression of the inducible invertase promoter in the presence 
of glucose and derepression by exhaustion of glucose could be utilized in expression of the green fluorescent protein 
as a heterologous protein. 

EXAMPLE 13 

20 

Incubation of strain ASP 138 (3) 

[0066] The transfomied S. pombe strain ASP1 38 (Example 9) was incubated in YPD medium containing 1 00 \x,g/m€ 
G418 (glucose concentration 8%) at 32**C to the late-growth phase, and after medium change, incubated in YPDG 

25 medium containing 100 ^g/mi G418 (glucose concentration 0.1%, glycerol concentration 3%) (Figure 14(a) and (b)). 
OD, FLU/OD and time denote the cell density, the fluorescence intensity per cell and the incubation time, respectively. 
The results show that while the green fluorescent protein was not expressed in the cells incubated without medium 
change (Untreated), expression of the green fluorescent protein in the cells incubated with the medium change was 
activated by the medium change (Medium-changed) probably because the depletion of glucose in the medium provoked 

30 derepression of the invertase promoter and thereby induced the protein expression. The high level expression induced 
by the medium change to a low-glucose expression medium from a high-glucose medium before glucose exhaustion 
suggests that use of a growth medium (with a high glucose concentration) and an expression medium (with a low 
glucose concentration) can differentiate between cell growth and protein expression. It was demonstrated that the 
repression of the inducible invertase promoter in the presence of glucose and derepression by exhaustion of glucose 

35 could be utilized in expression of the green fluorescent protein as a heterologous protein. 

EXAMPLE 14 

Construction of inducible lipocortin I expression vector pRIOLPI 

40 

[0067] PGR using plasmid pINVSOOO (Example 1) carrying the S. pombe invertase gene as the template and oligo 
DNAs shown in SEQ ID NOS: 1 4 and 15 in the Sequence Listing as primers gave an amplification product containing 
the promoter region in the invertase gene and having restriction enzyme recognition sequences at both ends. After 
terminal double restriction digestion with Spel and EcoRI, the amplification product was subjected to agarose gel elec- 
ts trophoresis. Purification of the band of about 3000 bp by the glass beads method using EASY-TRAP gave a fragment 
for use as a promoter insert. 

[0068] The expression vector pTL2L (JP-A-7-1 63373) carrying a human lipocortin I gene was subjected to agarose 
gel electrophoresis after tenninal double restriction digestion with EcoRI and Hindlll. Purification of the band of about 
1000 bp by the glass beads method using EASY-TRAP gave a fragment for use as the OPF insert. 
so [0069] The S. pombe multicloning vector pTL2M (JP-A-7-1 63373) was subjected to agarose gel electrophoresis after 
terminal double restriction digestion with Spel and Hindlll. Purification of the band of about 4500 bp by the glass beads 
method using EASY-TRAP gave a vector fragment. 

[0070] The three fragments were ligated with a DNA ligation kit and transformed into E, coli strain DH5. E. coli 
colonies were screened for the inducible lipocortin I expression vector pRIOLPI through base sequencing and restriction 
55 mapping, and the vector was recovered by the alkali-SDS method on a preparatory scale. 
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EXAMPLE 15 

Preparation of fission yeast Schizosaccharomyces pombe transformant ASP139 

5 [0071] A S. pombe wild-type strain ARC001 was transformed with the inducible lipocortin I expression vector pRIOLP I 
and a transducing vector pAL7 as described by Okazaki et al. 

[0072] 1 m^" of a preincubated ARC001 culture in YPD mediunn was incubated in minimum medium MB+Leu with 
shaking at 30°C for 16 hours to a cell density of 1x10^ per 1 m€, and the cells were collected, washed with water, 
suspended in 1 mf of 0.1 M lithium acetate (pH 5.0) and incubated at 30°C for 60 minutes. A 100 portion of the 

10 suspension was mixed with 2 iig of the recombinant vector pRIOLPI and 0.5 p.g of Pstl-digested pAL7 in 1 5 TE and 
further with 290 of 50% PEG 4000 thoroughly and incubated at 30*'C for 60 minutes, at 43''C for 15 minutes and at 
room temperature for 10 minutes, successively. After centrifugal removal of PEG, the cells were suspended in 1 m^ 
of 1/2 YEL+Leu medium. After 1 0-fold dilution, 1 m€ of the suspension was incubated at 32*0 for 2 hours, and a 300 
|if portion was spread on minimum medium agar MMA. After 3 days of incubation at 32 ^'C, about 300 independent 

15 colonies had developed on the plate. 

[0073] 1 0 colonies of the transformant were inoculated in 2 m^ of YEL medium containing 1 0 |xg/mf antibiotic G418 
(YEL1 0 medium) and incubated with shaking at 32*^0. 2 days later, 2 clones were viable. All the subcultures of them 
were viable 3 days later. The putative desired transformant (ASP138 strain) was frozen In glycerol and stored for use 
in subsequent experiments. 

20 

EXAMPLE 16 

Analysis of lipocortin I expression 

25 [0074] S. pombe transformant ASP 139 (Example 15) was incubated in YPD medium containing 100 \x,g/rx\€ G418 
(gtucoso concentration 8%) at 32''C to the stationary phase and collected as a non-inducible cell culture. Separately, 
ASP"^39 was incubated in the same medium at first to the mid-growth phase, then after medium change, incubated in 
^ YPDG medium containing 100 jig/mf G418 (glucose concentration 0.1%, glycerol concentration 3%) to the stationary 
phase and collected as an inducible cell culture. Both celt cultures were washed, suspended in 50mM tris-HC€ (pH 

30 7 5) anc disrupted with glass beads in a mini bead beater. After removal of the glass beads, the cell extracts were 
healed in the presence of SDS (1%) at 80**C for 15 minutes. 

[0075] 50 Mg protein from each extract was separated by SDS-potyacrylamide gel electrophoresis and stained with 
Coomassio BriHiant Blue. The extract from the inducible cell culture showed a major band of a molecular weight of 
about 45000. which is the same as the deduced molecular weigh of the recombinant lipocortin I protein, but the extract 

35 from the non-inducible cell culture did not. More sensitive western analysis of band density showed that the band from 
Ihc inducible cell culture extract was 10 times denser than the band from the non-inducible cell culture extract. The 
results show that while lipocortin I was not expressed in the cells incubated without medium change, the expression 
of lipocortin I in the cells incubated with the medium change was activated by the medium change probably because 
the depletion of glucose in the medium provoked derepression of the invertase promoter and thereby induced the 

40 protein expression. The high level expression induced by the medium change to a low-glucose expression medium 
from a high glucose medium suggests that use of a growth medium (with a high glucose concentration) and an ex- 
pression medium (with a low glucose concentration) can differentiate between cell growth and protein expression. It 
was demonstrated that the repression of the inducible invertase promoter in the presence of glucose and derepression 
by exhaustion of glucose could be utilized in expression of lipocortin I as a heterologous protein. 

. "^5 

EXAMPLE 17 

Construction of expression vector pTL2 IN VI carrying invertase gene 

50 [0076] PGR using plasmid pINVSOOO (Example 1) carrying the S. pombe invertase gene as the template and oligo 
DNAs shown in SEQ ID NOS: 16 and 17 in the Sequence Listing as primers gave an amplification product containing 
the ORF in the invertase gene and having restriction enzyme recognition sequences at both ends. Afterteiminal double 
restriction digestion with Afllll (New England Biolab) and Hindlll (Takara Shuzo Co., Ltd), the amplification product was 
subjected to agarose gel electrophoresis. Purification of the band of about 3000 bp by the glass beads method using 

55 EASY-TRAP gave a fragment for use as an insert. 

[0077] The S. pombe multlcloning vector pTL2M (JP-A-7-1 63373) was subjectedto agarose gel electrophoresis after 
terminal double restriction digestion with Spel and Ecol. Purification of the band of about 4500 bp by the glass beads 
method using EASY-TRAP gave a vector fragment. 



11 



BNSDOCID: <EP 1223219A2.I_> 



EP 1 223 219 A2 



[0078] The two fragments were ligated with a DNA ligation kit (Takara Shuzo Co., Ltd) and transfomned into E. coh 
strain DH5 (Toyobo Co.. Ltd.). E. coli colonies were screened for the invertase gene expression vector pRIOLPI through 
base sequencing and restriction mapping, and the vector was recovered nnultiplied by the alkali-SDS method on a 
preparatory scale. 

5 

EXAMPLE 18 

Construction of secretory expression vector pSL2l06a'c1 using the signal sequence from the invertase gene 

10 [0079] PCR using plasmid plNV3000 (Example 1) carrying the S. pombe invertase gene as the template and oligo 
DNAs shown in SEQ ID NOS: 18 and 19 in the Sequence Listing as primers gave an amplification product containing 
the ORF in the Invertase gene and having restriction enzyme recognition sequences at both ends. After temninal double 
restriction digestion with Spel (Takara Shuzo Co., Ltd.) and EcoRI (Takara Shuzo Co., Ltd.), the amplification product 
was subjected to agarose gel electrophoresis. Purifteation of the band of about 700 bp by the glass beads method 

15 using EASY-TRAP (Takara Shuzo Co., Ltd.) gave a fragment for use as a signal insert. 

[0080] PCR using plasmid pSL2P06a'c1 (WO96/23890) containing human iterleukin 6a*c1 variant cDNA as the tem- 
plate and ollgo DNAs shown in SEQ ID NOS: 20 and 21 in the Sequence Listing as primers gave an amplification 
product containing the iterleukin 6a'c1 variant ORF. After temrjinal double restriction digestion with EcoRI and Hindlll 
(Takara Shuzo Co., Ltd), the amplification product was subjected to agarose gel electrophoresis. Purification of the 

20 band of about 600 bp by the glass beads method using EASY-TRAP (Takara Shuzo Co.. Ltd.) gave a fragment for use 

as a gene insert. . . ^ • *+ 

[0081] The S. pombe multicloning vector pTL2M (JP-A-7-1 63373) was subjected to agarose gel electrophoresis after 
terminal double restriction digestion with Spel and Hindlll. Purification of the band of about 4500 bp by the glass beads 
method using EASY-TRAP gave a vector fragment. 
25 [0082] The three fragments were ligated with a DNA ligation kit and transformed intothe E. coli strain DH5 (Toyobo 
Co Ltd ). E. coli colonies were screened for the IL-6a'c1 secretory expression vector pSL2l06a'c1 through base se- 
quencing and restriction mapping, and the vector was recovered by the alkali-SDS method on a preparatory scale. 
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EXAMPLE 1 9 

Preparation of S. pombe transformant ASP168 



[0083] A leucine-requiring S. pombe strain ARC001 was transformed with the interleukin-6a'c1 variant secretory 
expression vector pSL2l06a'c1 (Example 18) and a transducing vector pAL7 as described by Okazaki et al. 

35 [0084] 1 mf of a preincubated ARC001 culture in YPD medium was incubated in 100 minimum medium MB+Leu 
with shaking at 30''C for 16 hours. The cells were collected, washed with water, suspended in 0.1 M lithium acetate 
(pH 5 0) at a cell density of 1 0^ ceHs/m€ and incubated at 30°C for 60 minutes. A 1 00 ^€ portion of the suspension was 
mixed with 2 jig of the recombinant vector pSL2l06a'c1 and 1 .0 jxg of Pstl-digested pAL7 in 1 5 ^€ TE and further wrth 
290 \ie of 50% PEG 4000 thoroughly and Incubated at SO'^C for 60 minutes, at 43"'C for 15 minutes and at room 

40 temperature for 10 minutes, successively. After centrifugal removal of PEG, the cells were suspended in 1 m€ of 1/2 
YEL+Leu medium After 1 0-f old dilution, 1 m€ of the suspension was incubated at 32°C for 2 hours, and a 300^€ portion 
was spread on minimum medium agar MM A. After 3 days of incubation at 32 «C, about 1000 independent colonies 
had developed on the plate. 

[0085] The transfomiants (colonies) were inoculated in 2 m€ of YEL medium containing 10 ^ig/mf antibiotic G418 
45 (YEL1 0 medium) and incubated with shaking at 32''C for 5 days. The viable clones of the putative desired transfomnant, 
designated as strain ASP168. were frozen in glycerol and stored for use in subsequent experiments. 

EXAMPLE 20 

50 Analysis of expressed secretory interieukin-6a'c1 variant in culture medium 

[0086] A fission yeast Schizosaccharomyces pombe transfomnant ASP168 (Example 19) was incubated in 
MA-Casamino acid medium (MA medium containing 2% Casamino acid and 3% glucose; for the composition of MA 
medium, referto "Alfa et al.. Experiments with Fission Yeast. Cold Spring HartDor Laboratoiy Press, 1993") containing 
55 500 \ig/rr}e G41 8 at 32°C for 2 days. 

[0087] The cell culture was centrifuged. and the supernatant was concentrated 1 00-fold through a membrane filter 
(Amicon Co.. Ltd.). Analysis of the concentrated sample by SDS-polyacrylamide electrophoresis followed by Coomas- 
sie Brilliant Blue staining gave the SDS-PAGE pattem shown in Figure 15. Lane 1 is the purified interleukin-6a'c1 
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variant, lane 2 is the supernatant from the ASP168 cell culture, and lane 3 is the supernatant from a cell culture of the 
control strain ASP021 [transformant carrying a recombinant vector with no ORF prepared by recombination-of-pTL2M 
(JP-A-7-163373) by the method disclosed in JP-A-7-163373 without introduction of any gene to be expressed]. The 
band with a molecular weight of about 20000 in lane 3 seemed attributable to the interleul<ln-6a'c1 variant from the 
5 comparison of lanes 1 and 3. 

[0088] Further analysis by western blotting using an antl-IL-6a'c1 gave the pattern shown In Figure 1 6. Lane 1 is the 
purified interleukln-6a'c1 variant, lane 2 is the supernatant from the ASP168 cell culture, and lane 3 is the supernatant 
from a cell culture of the control strain ASP02 1 . The band with a molecular weight of about 20000 in lane 3 was identified 
as the interleulcin-6a'c1 variant from the comparison of lanes 1 and 3. 
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SEQUINCt LISTING 

<1I0>' Asahi Glass Co.. Led.- , • i 

<120> Inducible promocer gene and secrecion signal gene luncvional in 
Schizosaccharofflvces pombe. expression vector coacainmg theo and their use 
<I30> AB23(F9865) . 
<150> JP9/31460S 
<151> 1997-10-31 
<160> 21 
<170> MS-DOS 
<210> 1 
<211> 474S 
<212> DNA 

<213> Schizosaccharomyces pombe 

ggauctagt ccgcgaaatc gagatgcttt gaagattaaa attaaattca attttatgcg 60 



10 



20 



45 



SO 



120 

180 
240 
300 
360 
420 



agactggttt ccttattttt tgtatagtcg catgcaagcg aggttcgcat aatttggaaa 
ataaaggtag tcaagaagac gttgaattaa ggctgcagit tcaaagtact ctacaaacga 
ttccttttaa aaaaaaagat tcaaaaaaaa ggcaaagggc ttaagtaatg cttgttattt 
caattta.cct ccaaacagtt actaatgcaa ttgcaaaaaa aaaacctacc tattgaacca 
25 aaatttctag cccatccatc gctcctcaag ataaaggaat cgatattttg agtttaaggg 

agtcgctgac agatttcaga attaaaaatt tttggaaaag galgtcgaga acaagaagac 
* acgtctagat tgctgatgat gcattctagc agacggaaat acaacgatat gtggacagca 480 

30 

cgacttttga tccgttcgga tcaaaaggaa gagaaatatc catctttcaa gaagaatgca 
ggaaaagcaa taaatgccca tttgattcct aaattatccc caaaaatgaa cattatgaga 
tccccUgtg ggagacagga aatttcgcaa ttccaaacga aaactcggct cttttttua 
ccccacagtt gcggggtaaa tgatgtaacg gaccttgggg gaaaggatga tgagttagtc 
gggaagcgga aaaaatggaa aacggaagta agaatagaaa ccagtatggc tgagtgcaat 
40 ggcggaaaag attttacaga gatgacaaga atctatttat c ta taaggaa aaact t t t tc 

caaatttgtc taaaaacgca ttctcctcaa ttgcctctag gtagatgata taacgaattg 
gaacgagaca tcgctaaccg gttttctttg taaatgacac tttgtagtgg gagtaagtct 
gaatggaggg atagacagat gaatagtatg agatagaaga atagtatata taatgatLaa 
gatgaacaaa taaaaattga aagaaaaaag aaatcgttgg ctcatttggt tcatacacac 
gttggttcat acaactttta cccatcgtaa gtattataag caaaaaatag agtacgaaaa 
gctataagta gtgaagcaaa aaaatagaaa aatagaaaaz aaaatatata taaaaaaaca 
taataaaaat aaaactcata agagacgtaa aacacaagaa ttgictatca tttgttctsc 



540 
600 
660 

720 
780 
840 
900 
960 
1020 
lOSO 
1140 
1200 
1260 



55 
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aagaa'gcacc" accac cccgc aaaacccttc a::tctcatc agcaaggacc ctlcTcaLtc 1320 

ciccctcctt agaatccttc ccactacaac gaaccggata acacgcaaat aagaacacac 13S0 

cccccaaata cgatatat.cg atccattttt tactttgccc agctcactgc tgtacaactc 1440 

catctaaaca gtttcccctc aagaaagatc gccaatggag gcgacaacat accggaattc 1500 

aagctgcgga cacagagcct gaaaagactg cattttgcat tgcctlcaag taaatgaaac 1560 

tgagttttga agtctcaaaa cacatcttat gtattgaaca ttagaagaac acataagata 1620 

gaccttgaga gctcaattca tcgacattct agccatcata ctgcgaccct agacactgtc 16S0 

agcacaacct tagatcgaaa atgaacacgt taccaaacgt tgtctaaaac ttgccgaatc 1740 

ttatctccgc attacttccg taatccttag tacatacgct gcaatttcgg aaggtcatga 1800 

tcgacttttt gtgtagctat aagtgacgca aatgagaaac atgacaaggt gcgatattta 1860 

gcaagatatt atgcatttga tggagaaagg aaatttcgga tgtatatata gtaccgttag 1920 

ctgcgctttt tttggtcatc cataattttc aaactcactg ctttcgatca gatttaccgt 1980 

ttttaaggtc tttattgctt tgtgatctgt aggttggaac atctatagtt catttcctaa 2040 

aagatcctct catcgtctca tcggatagta atcgttcaag aaaaaaaaag aaaaaaagaa 2100 

aaagaaaaag aaaagaaaaa taaaccgcta taattcatta cccatttgac tgaaggttct 2160 

tcatcttgaa ttgttttgaa tcaaaataaa gaaattatta ctattatttt ttttcttcgc 2220 

tttttcttta tccattcgtc gaaactattt ttctgctgat aaaagcaatc attccttttt 2280 

cctgcttctc ttgttactcg aattttaaac gacttttttt cctcgtccat tccctaattc 2340 

tttgcgacct tttctgattc tatccttggt tigtactttc gttgtgtaat tgttgagaaa 2400 

gtgaactgat tatttaattg ctgtgaaaaa aattctaaaa ctattttgtt Cttctcgatc 2460 

attcatcctt tgctcgctcg ctcgaatatt acagaaattc gt c tccc t t t' caacggaata 2520 

tgataatttg ttgaatactc taaatcaatt aacacctatc aaaagctgaa acattaaatc 2580 

tattctcacc aaaaaaaaag actcaagctt cttcgttgtt ggccggtctc ttttttgttt 2640 

lacgattgu aaattt-ata ctcacaactg ccaattctcc acttctgact attcattgat 2700 

agtccctatt taattttctg ttcaccgatt atcgtctttt ttgcaaataa tctctcttgg 2760 

aaccaaccaa ttaatacgtt ataatcgcta accttgaaga c t tgctaca- acg ttt ttg 2S1S 

Met Phe Leu 
1 

aaa tat att tta get agt ggc att tgc etc gee ict etc tta tea tct 2866 
Lys Tyr Me Leu Ala Ser Gly lie Cys Leu Val Ser Leu Leu Ser Ser 
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10 



2914 



10 



15 



20 



25 



30 



50 



55 



aca aac gcg get ccc cgi cac tea cat gca aaa eg; cat cct gcc a c 
Thr Asn Ala Ala Pro Arg His Leu Tyr Yal Lys Arg Tyr Pro Val He 
20 . 25 30 35 

tat aat get tec aac acc act gaa gcc age aat tec acc ace get cct 2962 
Tyr Asn Ala Set Asn He Thr Clu Val Ser Asn Ser Thr ihr Val Pro 
40 45 SO 

cct cct cca etc gta aat aca acg gcc cct aat ggg act tgt ttg ggt 3010 
Pro Pro Pro Phe Val Asn Thr Thr Ala Pro Asn Cly Thr Cys Leu Cly 
55 60 65 

aac tat aac gag tat ctt cct tea gga tat tac aat get acc gat cgt 305S 
Asn Tyr Asn GIu Tyr Leu Pro Ser Gly Tyr Tyr Asn Ala Thr Asp Arg 
70 75 SO 

ccc aaa att cat ttt acc cct tec tec ggt tec aCg aat gat cca aac 3106 
Pro Lys He His Phe Thr Pro Ser Ser Gly Phe Met Asn Asp Pro Asn 
85 90 95 

3154 



aaa tat tea gga aat ccc gtt ctt gae att aat tea etc caa tec cge 

Lys Tyr Ser Cly Asn Pro Val Leu Asp He Asn Ser Leu Gin Phe Arg 

215 220 225 

gac ccc aag gta ata tgg gat etc gat get aat cgt tgg gtg atg att 

Asp Pro Lys Val He Trp Asp Phe Asp Ala Asn Arg Trp Val Mee He 



gga ttg gta tat act ggc ggc gtc tat cac atg tec ttc caa tat tea 
Cly Leu Val Tyr Thr Gly Gly Val Tyr His Met Phe Phe Gin Tyr Ser 
100 105 110 11^ 

cca aaa act eta aca gcc ggc gaa gte cat tgg ggc cac aca gtt tec 
Pro Lys Thr Leu Thr Ala Cly Glu Val His Trp Gly His Thr Val Ser 
120 125 130 

aag gat tta ate cat tgg gag aat eat cct att gcc ate tac ccc gat 
Lys Asp Leu He His Trp Glu Asn Tyr Pro He Ala He Tyr. Pro Asp 
135 l*iO 145 

gaa cat gaa aac gga gtt ttg tec etc cca ttt agt ggc agt gca gtc 
35 Glu His Glu Asn Gly Yal Leu Ser Leu Pro Phe Ser Cly Ser Ala Yal 

150 155 160 

gte gat gtt cat aat tet tec ggt etc ttt tec aac gac acc att ccg 
Val Asp Val His Asn" Ser Ser Gly Leu Phe Ser Asn Asp Thr He Pro 
40 165 170 175 

gaa gag cge att gtt tta att cae acc gat cat tgg act ggt gtt set 
Glu Glu Arg He Val Leu He Tyr Thr Asp His Trp Thr Cly Val Ala 
ISO 185 190 • 195 

gag cgt cag get att gcg tat acc act gat ggt gga tat act ttc aaa 3442 
Glu Arg Gin Ala He Ala Tyr Thr Thr Asp Cly Cly Tyr Thr Phe Lys 
200 205 210 



3202 



3250 



329S 



3346 



3394 



3490 



3538 



16 



10 



15 



20 



35 



45 



50 



55 
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230 235 - ■ 240 

gta get acg tcl caa aat cac gga ate gcc tct tat tec ccc tat gac 35S6 
Val Ala Met Ser CIn Asn Tyr Cly He Ala Phe Tyr Ser Ser Tyr Asp 
245 250 255 

ttg att cac tgg acc gag tta cct get ttc tec act tct ggt tat ttg 3634 
Leu He His Trp Thr GIu Leu Ser Val Phe Ser Thr Ser Cly Tyr Leu 
260 265 270 275 

ggg ttg caa tat gaa tgc cct gga atg get cgt gtg ccc gtt gaa ggc 36S2 
Cly Leu Gin Tyr Clu Cys Pro Cly Met Ala Arg Val Pro Val Glu GIv 
280 285 290 

acc gat gaa tac aaa tgg gta etc ttc at etc catc aat cct ggc get 3730 
Thr Asp Glu Tyr Lys Trp Val Leu Phe He Ser lie Asn Pro Gly Ala 
295 300 305 

cca ttg gga gga tec gtt gtc caa tac ttt gtt ggc gat tgg aat ggt 3778 
Pro Leu Gly Gly Ser Va! Val Gin Tyr Phe Val Gly Asp Trp Asn Gly 
310 315 320 



ac4 aac ttc gtc ccc gat gat ggc caa act aga ttc gta gac ttg ggt 3826 

Thr Asn Phe Val Pro Asp Asp Cly Gin Thr Arg Phe Yal Asp Leu Gly 
2S 325 330 335 

aag gac ttt tac gcc age get ttg tat cac teg tct tec gcc aat gcc 3S74 

Lys Asp Phe Tyr Ala Ser Ala Leu Tyr His Ser Ser Ser Ala Asn Ala 
340 345 350 355 

30 

gat gtt att gga gtt gga tgg get age aac tgg caa tac acc aac caa 3922 

Asp Val He Cly Val Gly Trp Ala Ser Asn Trp Cln Tyr Thr Asn Gin 

360 365 370 

get cct act caa gtt ttc egc agt get atg aca gtt gca cga aaa ttc 3970 

Ala Pro Thr Cln Val Phe Arg Ser Ala Met Thr Val Ala Arg Lys Phe 
375 380 . 385 

act ctt egc gac gtt cct eag aac ccc atg ace aac ett act tct etc 4018 

Thr Leu Arg Asp Val Pro Gin Asn Pro Met Thr Asn Leu Thr Ser Leu 

390 395 400 • 

att caa acc cca ttg aat gtt tct etc tta cga gat gaa aca eta ttt 4066 

He Gin Thr Pro Leu Asn Val Ser Leu Leu Arg Asp Clu Thr Leu Phe 
405 410 415 



acc gca ccc gtt ate aat agt tea agt agt ett teg ggc tet ccg att 4114 

Thr Ala Pro Val He Asn Ser Ser Ser Ser Leu Ser Cly Ser Pro He 

420 425 430 435 

act ctt cca age aat acc gca ttc gag ttc aat Qtc aca etc agt ate 4162 

Thr Leu Pro Ser Asn Thr Ala Phe Clu Phe Asn Val Thr Leu Ser He 

440 445 450 

aat tac aca gaa ggc tgc aca aca gga tat tgt ctg ggg cgt att ate 4210 

Asn Tyr Thr Clu Cly Cys Thr Thr Cly Tyr Cys Leu Cly Arg He He 



17 



BNSDOCtD: <EP 



EP1 223 219 A2 



455 



460 



465 



10 



15 



20 



25 



30 



35 



act gat tec gat gat cca cac aga tea caa tec ate tec gcg gac gtt 4255 

rie Asp Ser Asp Asp Pro Tyr Arg Leu CIn Ser He Ser Val Asp Val 

470 ^75 480 

gac etc gca get age acc tta gee act aat cgt gee aaa gcc cag atg 4306 

Asp Phe Ala Ala Ser Thr Leu Va! lie Asn Arg Ala Lys Ala Gin Met 

485 490 495 

gga tgg ttt aat tea cCt etc acg cct tec ttc gcc aac gat act cac 4354 

Gly Trp Phe Asn Ser Leu Phe Thr Pro Ser Phe Ala Asn Asp lie Tyr 
500 505 510 515 

att tac gga aac gta act ttg tat ggt ate gtt gac aac gga ttg cct 4402 

lie Tyr Cly Asn Val Thr Leu Tyr Cly He Val Asp Asn Gly Leu Leu 
520 525 530 

gaa ecg tat gtc aat aat ggc gaa aaa act tac act aat gac ttt ttc 4450 

Clu Leu Tyr Val Asn Asn Gly Glu Lys Thr Tyr Thr Asn Asp Phe Phe 
535 540 545 

ttc ett caa gga gca aca cct gga cag ate age ttc get get ttc caa 4498 

Phe Leu Gin Cly Ala Thr Pro Gly Gin He Ser Phe Ala Ala Phe Gin 

550 555 560 

ggc gtt tct ttc aat aat gtt ace gtt aeg cca tta aag act ate tgg 4546 

Gly Val Set Phe Asn Asn Val Thr Val Thr Pro Leu Lys Thr He Trp 

565 570 575 

aat tgc taaatatttt gtttcaagtt aggaaagtat aataacttct gtccctgeat 4602 

Asn Cys 

5S0 

attcaattgt aaagtttagt ttatcelttc atcgtaacea caattgtcae ctaaatetet 4662 
aaaaatctct tcacttatct agttaatgtc gtaacaaaaa agtccagtag cttcgggaaa 4722 
tgatgcctgg aatcatacaa gtcgac 474S 



40 



45 



so 



<210> 2 
<211> 7286 
<212> DNA 

<213> Artificial Sequence 
<400> 2 

agcttgaaaa aacctcccac acctccccct gaacctgaaa cataaaatga atgcaattgc 60 
tgttgtcaac tcgtltatcg cagcttataa tggttacaaa caaagcaata gcatcacaaa J20 
ttteaeaaat aaagcatttt tttcaetgca ttetagttgt ggtttgtcea aaetcateaa ISO 
tgtatcttat eaigtctgga tcgatcccgg caggtcgggc gtcgcttggt eggteattce 240 
gaaceccaga gtcccgetea gaagaactcg tcaagaagge gacagaaggc gatgcgetge 300 



55 
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2a2ics2gag cggcgatacc gtaaagcicg'aggaagcggt cagcccactc gccgccaagc 360 

cccicagcaa catcacgggc agccaacgct acgccccgat agcggcccgc cacacccagc 420 

cggccacagc cgatgaatcc agaaaagcgg ccatcctcca ccatgatact cggcaagcag 4S0 

gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg 540 

aacagctcgg ctggcgcgag cccctgatgc tcttcgccca gatcatcctg atcgacaaga 600 

ccggcctcca tccgagtacg tgctcgctcg atgcgatgtt icgcttggCg gtcgaatggg 660 

caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc 720 

tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc 780 

cagicccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg S40 

gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg 900 

gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag 960 

cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga 1020 

gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atccccatcc tgtctcttga lOSO 

tcagatccgg gacctgaaat aaaagacaaa aagactaaac ttaccagtta actttctggt 1140 

ttttcagttc ctcgaggagc tttttgcaaa agcctaggcc tccaaaaaag cctcctcact 1200 

acttctggaa tagctcagag gccgaggcgg cctcggcctc tgcataaata aaaaaaatta 1260 

gtcagccatg gggcggagaa tgggcggaac tgggcggagt taggggcggg atgggcggag 1320 

ttaggggcgg gactatggtc gctgactaat tgagatgcat gctttgcata cttctgcctg 13S0 

ctggggagcc tggggacttt ccacacctgg ttgctgacta attgagatgc atgctttgca 1440 

tacttctgcc tgctggggag cctggggact ttccacaccc taactgacac acattccaca 1500 

ggacattgai tattgactag ttagtccgcg aaatcgagat gctttgaaga tlaaaattaa 1560 

attcaattct atgcgagact ggtttcctta ttttttgtat agtcgcatgc aagcgaggtt 1620 

cgcataatct ggaaaataaa ggtagtcaag aagacgttga attaaggctg cagtttcaaa 16S0 

gtactctaca aacgatlcct tttaaaaaaa aagatccaaa aaaaaggcaa agggtttaag 1740 

taatgcttgt tatttcaatt tacctccaaa cagttactaa tgcaatcgca aaaaaaaaac 1800 

ctacctattg aatcaaaatt tctagcccat ccatcgctcc tcaagataaa ggaaccgaca. 1S60 

ttttgagtct aagggagttg ctgatagatt icagaattaa aaatttttgg aaaaggatgt 1920 

cgagaacaag aagatacgtc tagattgctg atgatgcatt ccagcagacg gaaatacaac 19S0 
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c agcacgac t 


1 1 tgatccgu 


ccggatcaaa 


aggaagagaa 


acatccatcc 


2040 








agcaacaaac 


gcccatt cga 


t ccctaaac t 


atccccaaaa 


2100 


5 


a I gaaca eta 


eaoazitrTtC 




caggaaa 1 1 1 


cgcaat tcca aacgaaaat t 


2160 




cggc iCC tec 


I L t t aCCCC d 


r via t t ocouxf 


9 1 aaa t ca 1 6 


taacggacct 


tgggggaaag 


2220 


10 


gatgatgagt 


tagt tgggaa 


gCggaaaaad 




aagcaagaat 


agaaaccagt 


2280 




atggccgagC 


gcaatggcgg 


aaaagat 1 1 1 


acagaga i ga 


caagaatc ta 




2340 




aggaaaaact 


t tt tccaaat 


1 1 g t c t aaaa 


acgcat tctc 


ctcaat cgcc 




2400 


15 


tgatataacg 


aat tggaacg 


agaca tcgc t 


aaccggtt tt 


ctt tgtaaat 


gacac I L LgL 


2460 




agtgggagta 


agt t tgaatg 


gagggataga 


caga tgaata 


gtatgagata 


gaagaaiagL 


2520 


20 


atatacaatg 


attaagatga 


acaaataaaa 


at tgaaagaa 


aaaagaaat t 


gt tggc tea t 


2580 


ttggtccata cacatgttgg 


cccacacaac 


ttttacccat cgtaagtatt 


ataagtaaaa 






aatagagtac 


gaaaagctat 


aagtagtgaa 


gcaaaaaaat 


agaaaaatag 


aaaaaaaaac 


2700 


25 


atatacaaaa 


aaatataaca 


aaaataaaac 


tcataagaga 


cgtaaaacac 


aagaa t tgtc 


2760 




tatcatt tgt 


tctt taagaa 


gcaccaccac 


tctgtaaaac 


tcttcat ttc 


teat tagcaa 


2S20 




ggacccttt t 


cat tccttcc 


tct ttagaat 


ccttttcatt 


ataacgaat t 


ggataacacg 


28S0 


30 


caaat aagaa 


cacatcccct 


aaatacga ta 


tatcgatcca 


ttttt tactt 


tgcctagct t 


2940 




at tgctgtac 


aattccattt 


aaatagt t tc 


tec tcaagaa 


agatcgtcaa 


tggaggcgac 


3000 


35 


aatataccgg 


aatt taagct 


gcggacacag 


agct tgaaaa 


gactgcatt t 


tgta t tgt 1 1 


3060 


tcaagtaaat 


gaaactgagt 


tttgaagtct 


caaaatacat 


ct tatgtatt 


gaaca 1 1 aga 






agaacatata 


agatagatct 


tgagagctca 


at tcatcgac 


attctagcca 


tea t ac t gcg 


3 ISO 


40 


ate t tagaca 


t tgtcagcac 


aacc t tagat 


cgaaaa tgaa 


cacgt tacca 


aacgt tgtet 


3240 




aaaac t tgcc 


gaatct tacc 


tccgcat tac 


t tccgtaatc 


ct tagtacat 


acgctgcaat 


3300 




t tcggaaggt 


catgatcgac 


t ttctgtgta 


gctataagtg 


acgcaaatga 


gaaacatgac 


3360 


43 


aaggtgcga t 


at t tagcaag 


a tat tatgca 


1 1 tgatggag 


aaaggaaat t 


tcggatgtat 


3420 




atatagcacc 


gttagctgcg 


ctttttttgg 


tea tcca taa 


ttttcaaact 


eaetgctt te 


3480 


50 


gaccagat 1 1 


accgttttta 


aggtctttat 


tgctt tgtga 


tctgtaggt t 


ggaaca t c t a 


3540 


tagttcattt 


tctaaaagat 


cc 1 1 tcatcg 


; tttcatcgga 


tagtaaccgt 


tcaagaaaaa 


3600 




aaaagaaaaa 


aagaaaaaga 


aaaagaaaae 


[ aaaaataaac 


cgctataatt 


cat tacccat 


3660 
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ttgactgaag gt tct tcacT : tgaat tgJ t ttgaatcaaa ataaagaaat tattatcatt ^720 
•attttttttc ttcgctcttt cvttacccat tcgccgaaac tattttcctg ctgataaaag 3730 
caatcattcc tttttcctgc ctctcttgtt attcgaattt caaacgactt tctcicctcg 3S40 
tccatccccc aattccttgc gacctttccc gatcctatcc ttggttcgca ctttcgttgt 3900 
gtaattgttg agaaagtgaa ctgattattt aattgttgcg aaaaaaattc taaaactatt 3960 
ttgtmtct tgatcattca tcctttgctc gcttgcttga atattacaga aattcgtctc 4020 
cctttcaacg gaatatgata atttgtigaa tactctaaat caattaacac ctaccaaaag 40S0 
ctgaaacatt aaatctattc tcaccaaaaa aaaagacica agcttcttcg ttgttggccg 4140 
gtctcttttt tgttttacga ttgttaaatt ttatactcac aactgccaac tctccactct 4200 
tgactattta ttgatagtcc ctatttaatt ttctgttcac cgattatcgt cttttttgta 4260 
aataatcttt cttggaacca accaattaat acgttataat cgctaacttt gaagatttgc 4320 
tacaatggca atggtatcag aattcgagct cggtacccgg ggatcctcta gagtcgacct 4380 
ecaggcatgc aagcttaaat aggaaagttt cttcaacagg attacagtgt agctacctac 4440 
atgctgaaaa atatagcctt taaatcattt ttatattata actctgtata atagagataa 4500 
gtccattttt taaaaatgtt ttccccaaac cataaaaccc tatacaagtt gttctagtaa 4360 
caatacatga gaaagatgtc tatgtagctg aaaataaaat gacgtcacaa gacaaaaaaa 4620 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaagta 4680 
ccttctgagg cggaaagaac cagccggatc cagacatgac aagatacatt gatgagtttg 4740 
gacaaaccac aactagaatg cagtgaaaaa aatgctttat ttgtgaaatt tgtgatgcta 4800 
ttgctttatt tgtaaccatt ataagctgca ataaacaagt taacaacaac aattgcattc 4860 
attttatgtt tcaggttcag ggggaggtgt gggaggtttt ttaaagcaag taaaacctcc 4920 
acaaatgtgg tatggctgat tatgaUcgg ctgcctcgcg cgtttcggtg atgacggtga 4980 
aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg 5040 
gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gcgcagccat 5100 
gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc atcagagcag 5160 
atlgtactga gagcgcacca tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa 5220 
taccgcatca ggcgctcttc cgcttcctcg ctcactgacc cgctgcgctc ggtcgttcgg 5250 
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 5340 
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gataacgcag gaaagaatga gcaaaaggcc agcaaaaggc caggaaccgc aaaaaggccg 5^00 

cgccgctggc gttltcccat aggctccgcc cccctgacga gcaccacaaa aaccgacgct 5460 

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 5520 

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc S580 

tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 5640 

aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 5700 

ccttatccgg taactatcgt cctgagtcca acccggtaag acacgactta tcgccactgg 5760 

cagcagccac tggtaacagg atcagcagag cgaggtatgt aggcggtgct acagagttct 5S20 

tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgcCctgc 5880 

tgaagccagt taccttcgga aaaagagitg gtagctcttg atccggcaaa caaaccaccg 5940 

ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 6000 

aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 6060 

aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 6120 

aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 6180 

gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 6240 

gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 6300 

caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 6360 

ccggaagggc cgagcgcaga agtggtcctg caacucatc cgcctccaic cagtctatta 6420 

attgttgccg ggaagctaga gtaagtagtt cgccagtiaa tagtttgcgc aacgttgttg 6480 

ccattgctgc aggcatcglg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 6540 

gttcccaacg atcaaggcga gttacatgat cccccatgtl gtgcaaaaaa gcggttagct 6600 

ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 6660 

tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttc tctgtgactg 6720 

gtgagtactc aaccaagica ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 6780 

cggcgtcaac acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 6S40 

gaaaacgttc ttcggggcga aaactctcaa ggatcctacc gctgttgaga tccagttcga 6900 

tgtaacccac tcgtgcaccc aactgatctt cagcatcttt cactttcacc agcgtttctg 6960 

ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat /020 
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gicgaacacc catacccctc cctccccaac atcatcgaag cacccacca^ gg'ccaccgtc 7080 

ccacgagcgg acacacacic gaacgtaccc agaaaaataa acaaacaggg gttccgcgca 7140 

cacttccccg aaaagtgcca cctgacgtct aagaaaccac tattatcatg acaccaacct 7200 

ataaaaatag gcgtatcacg aggccccttc gtccccaaga atcggccgac caattctcat 7260 

gtttgacagc ttatcatcga taagcc 7286 



<210> 3 
<211> 27 
?5 <2i2> DNA 



<2i3> Artificial Sequence 
<400> 3 

gtttgaattc atgaaygayg gnaaygg 27 

<210> 4 
<2il> 24 
<2i2> DNA 

<213> Artificial Sequence 
<400> 4 

gtttgaattc yrytggggnc aygc 24 



• <210> 5 

<211> 27 
<212> DNA 

<213> Artificial Sequence 
<400> 5 

gtttgaattc acyttnggrt cncgraa 27 



<210> 6 
<211> 24 
<212> DNA 

<2]3> Artificial Sequence 
<400> 6 

gatgggtaaa actagtatga acca 24 



<210> 7 
<21I> 23 
<212> DNA 



<213> Artificial Sequence 
<400> 7 

ttgcgaatta ctagtctccc aca 23 



<2i0> S 

<211> 29 

<212> DNA 

<213> Artificial Sequence 

<400> S 



23 
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gSgactasti agcccgcgaa accgagacg 



29 



5 <210> 9 

<211> 42 
<212> DMA 

<213> Artificial Sequence 
<400> 9 

10 gaggaattct gataccatcg ccactgtagc aaaccticaa ag 42 

<210> 10 
<211> 30 
15 <212> DMA 

<213> Artificial Sequence 
<:400> 10 

aaagctagcc attgtagcaa atcttcaaag 30 

<2iO> 11 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<400> 11 

atggctagca agggcgagga gctgccc 27 

<210> 12 
<211> 27 
30 <212> DNA 

<213> Artificial Sequence 
<400> 12 

cccaagctta cttgtacagc tcgCcca 27 



<210> 13 
<211> 7938 
<212> DNA 

<213> Artificial Sequence 
<400> 13 

40 agcttgaaaa aacctcccac acctccccct gaacctgaaa cataaaatga atgcaactgt 60 

tgctgttaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 120 
tttcacaaat aaagcacttt ttccactgca ctctagttgt ggtttgtcca aactcatcaa 180 
tgtatcttat catgtctgga tcgatcccgg caggttgggc gtcgcttggt cggtcatttc 240 
gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gaiagaaggc gatgcgccgc 300 
gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccactc gccgccaagc 360 
ccttcagcaa tatcacgggt agccaacgct ztgtcccgat agcggtccgc cacacccagc 420 
cggccacagt cgatgaatcc agaaaagcgg ccattctcca ccatgatatc cggcaagcag 4S0 

55 



45 



SO 



24 
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10 



30 



40 



gcaicgccac 


gggtcacgac 


gagatcctcg 


ccgtcgggca" 


tgcgcgcci t 


gagce tggeg 


540 


aacagc t egg 


ccggcgcgag 


cccccgacgc 


tec tcgccca 


gatcacectg 


atcgacaaga 


600 


ccggct tcca 


tccgagtacg 


tgc tcgc ccg 


acgcgatgct 


tcgc t cggtg 


gtcgaacggg 


660 


caggCagccg 


gatcaagcgt 


atgcagccgc 


cgcac tgcat 


cagccatgac 


ggacact t tc 


720 


tcggcaggag 


caaggCgaga 


tgacaggaga 


tec tgccccg 


gcac t ccgcc 


caatagcagc 


780 


cagtccc tec 


ccgct tcagt 


gacaacgccg 


agcacagetg 


cgeaaggaac 


gcccgtcgtg 


S40 


gccagccacg 


acagccgcgc 


tgcctcgtcc 


tgcagt teat 


ccagggcacc 


ggacaggtcg 


900 


gtcttgacaa 


aaagaaccgg 


gcgcccctgc 


gccgacagcc 


ggaaeacggc 


ggcatcagag 


960 


cagccga ttg 


tctgt tgtgc 


ccagtcatag 


ccgaat agcc 


tc tccaccca 


agcggccgga 


1020 


gaacctgcgt 


gcaa tcca tc 


t tgt tcaatc 


atgcgaaacg 


atce teatcc 


tgtctcttga 


1080 


tcagatccgg 


gacctgaaat 


aaaagacaaa 


aagactaaac 


t taccagtta 


actttctggt 


1140 


1 1 1 tcagttc 




ttcttgcaaa 


agcc taggcc 


t ecaaaaaag 


cctcc tcact 


1200 


ac 1 1 c t ggaa 


t age tcagag 




ec t cggectc 


tgca t aaa ta 


aaaaaaat ca 


1260 


gtcagccatg 


gzscgeagaa 




tg?ecg£aii t 


*• O O & ft o 


a t E^gficczaK 


1320 


1 1 aggggCEg 


gac t a tggt t 


gctgactaa t 


tgaga tgcat 


ge 1 1 1 gca t a 


e t tc tgcc tg 


1380 


ct ggggagcc 


tggggactt C 


ccacacc tgg 


t tgctgacta 


a t tgaga tgc 


a t gc 1 1 tgca 


1440 


tac t Cc t gcc 


tgctggggag 


cc tggggact 


t cccacaccc 


t aac t gacac 


aca t tccaca 


1500 


ggaca 1 1 ga t 


tat tgactag 


t tagtccgcg 


aaa tcgagat 


gc 1 1 1 gaaga 


1 1 aaaa 1 1 aa 


1560 


atttaat ttt 


atgcgagac t 


ggtttcctta 


ttt tttgtat 


agt cgca tgc 


aaeceagg 1 1 


1620 


cgca taa ttt 


ggaaaataaa ggtagtcaag aagacgttga 


a 1 1 aagge tg 


cagt t tcaaa 


1680 


gt ac tc t aca 


aacgat tec t 


1 1 taaaaaaa 


aagat tcaaa 


aaaaaggcaa 


aeeg 1 1 taag 


1740 


taa tgc t tg t 


tatt tcaatt 


tacc tccaaa 


cagt tac taa 


tgcaa t tgca 


aaaaaaaaac 


1800 


ctaccta t tg 


aatcaaaa 1 1 


tc tagcccat 


eca tcgc tec 


t caaga taaa 


ggaa t cga t a 


1860 


1 1 1 tgagt 1 1 


aagggagt tg 


c tgatagat t 


ccagaat taa 


aaat 1 1 1 tgg 


aaaaggatgt 


19 20 


cgagaacaag 


aagatacgtc 


taga t tgc tg 


a cgatgca t c 


c tagcagacg 


gaaatacaac 


1980 


gacatgtgga 


cagcacgac t 


t ttgatccgt 


ccgga tcaaa 


Aggaagagaa 


a tatccatct 


2040 


t tcaagaaga 


atgcaggaaa 


agcaa taaa t 


gcccat C Cga 


t tcctaaat t 


a tceceaaaa 


2100 


atgaacat ta 


tgagatc t tc 


t tgtgggaga 


caggaaat 1 1 


cgcaa t tcca 


aacgaaaa 1 1 


2160 
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10 



15 



20 



atatataatg attaagatga acaaataaaa actgaaagaa aaaagaaatt gctggctcat 
ttggttcata cacatgttgg ttcatacaac ttttacccat cgtaagtatt ataagtaaaa 
aatagagtac gaaaagctat aagtagtgaa gcaaaaaaat agaaaaatag aaaaaaaaat 
atatataaaa aaatataata aaaataaaac tcataagaga cgtaaaacac aagaatcgtc 
tatcatttgt tctttaagaa gcaccaccat tctgtaaaac tcttcatttc tcattagcaa 
ggaccctttt cattccttcc tctttagaat ccttttcatt ataacgaatt ggataatacg 
caaataagaa cacatcccct aaatacgata tatcgatcca ttttttactt tgcctagctt 
attgctgtac aattccattt aaatagtttc tcctcaagaa agatcgtcaa tggaggcgac 
aatataccgg aatttaagtt gcggacacag agcttgaaaa gactgcattt tgtattgttt 
tcaagtaaat gaaactgagt tttgaagtct caaaatacat cttatgtatt gaacatcaga 
agaacatata agatagatct tgagagctca attcatcgac attctagcca tcatactgcg 
atcttagaca ttgtcagcac aaccttagat cgaaaatgaa cacgttacca aacgttgtct 
aaaacttgcc gaatcttatc tccgcattac ttccgtaatc cttagtacat acgctgcaat 
ttcggaaggt catgatcgac tttttgtgta gctataagtg acgcaaatga gaaacatgac 
■^0 aaggtgcgat atttagcaag atattatgca cttgatggag aaaggaaatf tcggatgt-at 

atacagtacc gttagctgcg ctttttttgg tcatccataa ttttcaaacl cactgcttic 
gatcagattt accgttttta aggtctttat tgctttgtga tctgtaggtl ggaacatcta 
tagttcattt tctaaaagat cctttcatcg tttcatcgga tagtaatcgl tcaagaaaaa 
aaaagaaaaa aagaaaaaga aaaagaaaag aaaaataaac cgctataatt cactacctat 
Ugactgaag gUcttcatc ttgaattgU ctgaatcaaa ataaagaaat tattattatt 
attttttttc ttcgcttttt ctttatccat ccgtcgaaac tatttttctg ctgataaaag 
caatcattcc tttttcctgc ttctcctgtt aitcgaattt laaacgactt tttttccicg 
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cggctcttcc tcttacccca cagctgcggg gtaaacgatg laacggacct igggggaaag 
gatgacgagt cagicgggaa gcggaaaaaa iggaaaacgg aagtaagaac agaaaccagt 
acggctgagt gcaatggcgg aaaagattct acagagatga caagaaccta tttacctata 
aggaaaaact ttttccaaat ttgtctaaaa acgcattctc ctcaattgcc tctaggtaga 
tgatataacg aattggaacg agacatcgct aaccggtttt ctttgtaaat gacattttgt 
agtgggagta agtttgaatg gagggataga cagatgaata gtatgagata gaagaacagt 2520 

2580 
2640 
2700 
2760 
2S20 
2S80 
2940 
3000 
3060 
3120 
3150 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
37S0 
3S40 
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tccac cccct 


aat cct c cgc 


gacctlitct 


gat cccaccc 


ttggct cgca 


cctccgtcgt 


3900 




5 


gcaaccgc eg 


agaaagtgaa 


ctgat cacc c 


aaccgctgcg 


aaaaaaac cc 


taaaaccac c 


3960 






ttgt ct ttcc 


tgatcat tea 


tcct t tgctc 


gcccgct cga 


acat cacaga 


aat tcgtctc 


4020 






cctt tcaacg 


gaatatgata 


ac t tgc tgaa 


tactccaaat 


. caac caacac 


c t a tcaaaag 


4080 




10 


ctgaaacac c 


aaatctac tc 


t caccaaaaa 


aaaagactca 


agcttcctcg 


t tgt tggccg 


4140 






gtctctcttt 


tgtt ttacga 


t tgc taaatt 


t tatac tcac 


aac tgccaat 


tctccacc tc 


4200 






tgactat t ta 


ttgacagtcc 


ccact taatt 


t tctgttcac 


cga t taccgt 


cttttt tgta 


4260 




15 


aataatcttt 


ct tggaacca 


accaat taat 


acgttataat 


cgc taac 1 1 1 


gaagat t tgc 


4320 






tacaatggct 


agcaagggcg 


aggagctgtt 


caccggggtg 


gtgcccatcc 


tggtcgagct 


43S0 




20 


ggacggcgac 


gtaaacggcc 


acaagt tcag 


cgtgtccggc 


gagggcgagg 


gcga tgGcac 


4440 






ctacggcaag 


ctgaccctga 


age tcatctg 


caccaccggc 


aagc tgcccg 


tgccc tggcc 


4500 






caccctcgtg 


accaccc tga 


cc tacggcgt 


gcagtgct tc 


agccgc tacc 


ccgaccacat 


4560 




25 


gaagcagcac 


gacttct tea 


agtccgccat 


gcccgaaggc 


t acg t ccagg 


ascccacca t 


4620 






c t tct tcaag 


gacgacggca 


act acaagac 


ccgcgccgag 


s teaaff 1 1 cs 




4680 




30 


cctggtgaac 


cgcatcgagc 


tgaagggcat 


cgact tcaag 


saecracff sea 


ac a tec tcxfif 


4740 




gcacaagctg 


gagtacaact 


acaacagcca 


caacgtc tat 


a tea t ffffccff 




4S00 






gaacggcatc 


aaggt gaac t 


tcaagatccg 


ccacaacatc 


i?azsac s7E^ca 




4S60 




35 


cgccgaccac 


t accagcaga 


acacccccat 


cggczacegc 


cccg t gc t gc 


t ffcee ?aca a 


4920 






ccac tacctg 


agcacccagc 


ccgccctgag 


caaagacccc 


aac Easaafifc 


ffc^atcacat 


49S0 

1 ^ o w 






ggtcctgctg 


gagt tcgtga 


ccgccgccgg 


gatcactc tc 




a ff c t ff t ac a a 


5040 




40 


gtaagct taa 


ataggaaagt 


t tcctcaaca 


ggat tacagt 


g tagc tacc t 


acat cc t ff aa 


5100 






aaatatagcc 


tt taaatcat 


t Cttatat ta 


taactc tgta 


taatagaga t 


aag tccat 1 1 


5160 




45 


1 1 taaaaatg 


1 1 1 tccccaa 


accataaaac 


cctacacaag 


t tgt tctagt 


aacaa t acat 


5220 




{faff 223173 t O 






a I gacg I cac 


aagac aaaaa 


aaaaaaaaaa 


COCA 






aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaag 


tacc t tctga 


5340 




SO 


ggcggaaaga 


accagccgga 


tccagacatg 


a taagataca 


t tgatgagt t 


tggacaaacc 


5400 






acaac tagaa 


tgcagtgaaa 


aaaacgc t c t 


at t tgtgaaa 


t ttgtgatgc 


tattgcct ta 


5460 






c C tgtaacca 


ttacaagcig 


caa taaacaa 


gt taacaaca 


acaat tgca t 


tcac 1 1 ta eg 


5520 





55 



27 



BNSOOCID: <EP. 



.1223219A2_L> 



EP1 223 219 A2 





c c I c ag^ lie 


agggggaggc 


gtgggaggcc 


ccccaaagca 


agcaaaaccc 


c t acaaa t gt 


55S0 


5 


gg c a cggc t g 


at tacgaccc 


ggccgcctcg 


cgcgc ctcgg 


t ea ceacegc 


gaaaacc tc t 


5640 




gacacacgca 


gctcccggag acggtcacag cctgtctgta 




? 0 0 a f? c 2 3 c 


5700 




aagcccgtca 


gggcgcg ica 


gCggg I gt ig 


araatftatro 








10 


cacgtagcga 


tagcggagtg 


c a c ac I ggc c 


f ^ ^ ^ t ^ t ffftr 

L aac i a V gc g 


gca t c agagc 


aga t L g I ac i 






gagagtgcac 


catatgcggt 


gtgaaa tacc 


gcacaga t gc 


gt aaggagaa 


aa t accgca t 


CCCA 


15 


caggcgctct 


tccgcttcct 


cgc tcaccga 


c ccgccgcgc 


tcggtcgt tc 


ggc tgcggcg 




agcggtatca 


gctcactcaa 


aggcggtaat 


acggt ta tec 


acagaatcag 


ggga taacgc 


DUUU 




aggaaagaat 


gagcaaaagg 


ccagcaaaag 


gccaggaacc 


gtaaaaaggc 


cgcgt tgctg 


oOoO 


20 


gcgtttttcc 


ataggctccg 


cccccctgac 


gagcatcaca 


aaaatcgacg 


c tcaagtcag 


6120 




aggtggcgaa 


acccgacagg 


actataaaga 


taccaggcgt 


t tccccctgg 


aagctccctc 


6180 




gtgcgctctc 


ctgttccgac 


cctgccgct t 


accggatacc 


tgtccgcctt 


tctccct teg 


6240 


25 


ggaagcgtgg 


cgct t tctca 


tagctcacgc 


tgtaggtatc 


tcagt tcggt 


gtaggtcgtt 


6300 




cgctccaagc 


tgggctgtgt 


gcacgaaccc 


cccgt tcagc 


ccgaccgc tg 


cgcct ta tec 


6360 


30 


ggtaactatc 


gtct tgagtc 


caacccggta 


agacacgac t 


tatcgccac t 


ggcagcagcc 


6420 


actggtaaca 


ggat tagcag 


agcgaggta t 


gtaggcggtg 


ctacagagt t 


cttgaagtgg 


64S0 




tggcctaact 


acggctacac 


tagaaggaca 


gtatt tggta 


tctgcgc tct 


gctgaagcca 


6540 


35 


gt tacct teg 


gaaaaagagt 


tggtagctc t 


tgatccggca 


aacaaaccac 


cgctggtagc 


6600 




ggtggttttt 


ttgtttgcaa 


gcagcagat t 


acgcgcagaa 


aaaaaggatc 


tcaagaagat 


6660 




cctttgatct 


tt tctacggg 


gtctgacgct 


cagtggaacg 


aaaac tcacg 


t taagggat t 


6720 


40 


t tggtcatga 


gattatcaaa 


aaggatc t tc 


acc tagatcc 


ttt taaatta 


aaaatgaagt 


6780 




1 1 taaatcaa 


tctaaagtat 


atatgagtaa 


act tggtctg 


acagt tacca 


atgcttaatc 


6o40 


45 


agtgaggcac 


ctatc tcagc 


gatctgccta 


tc tcgtccat 


ccatagc tgc 


ctgac tcccc 


CO A A 
09U0 


gtcgtgtaga 


taac tacgat 


acgggagggc 


t taccacc tg 


gccccagtgc 


tgcaa tgata 


6960 




ccgcgagacc 


cacgctcacc 


ggctccagac 


c tatcagcaa 


t aaaccagcc 


agccggaagg 


7020 


SO 


gccgagcgca 


gaagtggtcc 


t gcaac 1 1 ta 


tccgcct cca 


tccag tc tat 


taattgt Cgc 


70S0 




cgggaagcta 


gagtaagtag 


t tcgccagt t 


aat agt c tgc 


gcaacgt tgt 


tgccat tgcc 


7H0 




gcaggcaccg 


tggcgtcacg 


c tcgccgt 1 1 


ggtatggctc 


cat tcagc tc 


cggt tcccaa 


7200 



55 
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cgaccaaggc gagtracatg accccccatg ccgcgcaaaa aagcggttag c icc t tcggc^^ 7260 

cctccgatcg ctgtcagaag taagciggcc gcagcgccac cactcatggc catggcagca 7320 

ccgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac cggtgagtac 7380 

tcaaccaagt cattctgaga atagCgtatg cggcgaccga gctgctctcg cccggcgtca 7440 

acacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcac cggaaaacgc 7500 

tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 7560 

actcgtgcac ccaactgatc ttcagcatcc tttactttca ccagcgtttc tgggtgagca 7620 

aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaaca 7680 

ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 7740 

ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 7S00 

cgaaaagtgc cacctgacgt ctaagaaacc attattatca tgacattaac ctataaaaat 7860 

aggcgtatca cgaggccctt tcgtcttcaa gaattggtcg accaattctc atgtttgaca 7920 

gctcatcatc gataagct 793S 



<210> 14 
<21i> 29 
30 <212> DNA 



<213> Artificial Sequence 
<400> J4 

gggactagtt agtccgcgaa atcgagatg 29 

<210> 15 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<400> 15 

gaggaattct gataccattg ccattgtagc aaatcttcaa ag - 42 



<210> 16 
<211> 28 
45 <212> DNA 



<213> Artificial Sequence 
<400> 16 

gggac atgtt tttga aatat atttc age 28 

<2I0> 17 

<211> 27 

<:212> ONA 

<213> Artificial Sequence 

<400> 17 
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gggaa^gctca gcaat cccag atagt ct 
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10 



<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<400> 18 

ttgac tagtt attaa tagta 



20 



15 



20 



25 



30 



<210> 19 
<211> 29 
<:212> DNA 

<213> Artificial Sequence 
<400> 19 

cccga attca taacg tttta catat aagt 



<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<400> 20 

aaaga attcc cagta ccccc aacct cttca 



<210> 21 

<211> 20 

<212> DMA 

<213> Artificial Sequence 

35 <400> 21 

aaaat gattt aaagg ctata 
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Claims 

1. A peptide having the amino acid sequence of Met Phe Leu Lys Tyr lie Leu Ala Ser Gly lie Cys Leu Val Ser Leu 

Leu Ser Ser Thr Asn Ala. 



45 



so 



55 



2. A DNA encoding the peptide of claim 1 . 

3. A recombinant vector containing the sequence of the DNA according to claim 2. 

4. A multicloning vector containing the sequence of the DNA according to claim 2 and a multicloning site. 

5. AnexpressionvectorcontalningthesequenceoftheDNAaccordingtoclaim2andaheterologousprotein 
gene. 

6. A Schizosaccharomyces pombe transfonnant carrying the expression vector according to claim 5. 

7. A process for producing a protein which comprises incubating the transformant according to claim 6 and recovering 
an expressed heterologous protein. 
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I G . 1 



S. pom6« invj 91 , TTAPMCTCLCHTNEYI^PSCTTNATDnPKlIirTPSSCrMMOPHGLVrr TC 

5- accidentalis 11 : PLTtTrrcXVASSSIDLSVDTaBYHRPLIHrTPEKGHMNDPHGTPYDKTA 

S* csrsvlsiae SUC2 3 i LQArTrTLACrAAKHSASMTNETSDRPLVHrXPNKGHMNDPHGLHIOEKD 

S. pomba i/lVj 101 : CVTHHFrQYSPKTLTACE- VnHGHTVSKbX,IHHENX PlAIYPOEHENGVX 

S- occidentdiij 61 : KTHKMrQYNPNATAHGQPLYWCHAtSNOLVHHOEHEMAIGPBHDNEGIF 

5- cerdviaiad 53 ': ARHHTTrQTNPNDTVHCTPLFHGHATSDDLTNHEOQPlAIAPKRNDSC 



S- ponf6« invJ 151 
^. occidentaiia 111 
5- cejrevi9io« 5C/« 103 



SLPrSGdAWDVHH — — SSG&FSHDTZ-PECRZVZ«XXTDHHTGVAERQ 

SGSIWDHNN TSGFF-NSSIOPMQnZVAZYTNtlHPDL-QTQ 

— ArSGSHWDTNK TSGrF-MbTZOPRQnCVAZHT*ZNTPC6eEQ 



S. pamba invi. 201 
5. occid^ntalis 161 
5. cejroi^isi«« ^ac^ 153 



AZAYTT06GXTrKKYSGNPVX«DZNS£»QrR0Pl^VMW0FDAHnHVXZVAMSQ 
piAPSLOCGXTFTKICENNPVZDVSSNOrRDPRVFHHSRrKSHDKGCSEIA 
YZSXSTDGGXTFTEXQRHPVT«AANSTQFR0PKVFt^XeP6Ql(HZHTAAR3Q 



5. pombe Invj 251 'i 

5. oecide/italia 211 i 
5. eereviaiae 203 i 



N¥GIArYSSXDI«IHWTE£.SVr9TSGYI*CLQY£CPGHARVPVEGT05YK-W 
RVXZairC9ANLKmfVXNSNrS-SGYYGKQYGH9iU.IEVPICMSDKSK-H 
0YKiEXTSS0OLKSHKTE5AFANecrZrCYQYECPCZ.ZEVPTeQOPSKgYH 



S, pombe 'Invi 301' 
5. pccidentaiis 261 
S.' c0f9W«iao 5(/C2 2 53 



I VLFZSINPGAP-LCGSWQTrVGDHNCTNrVPDDGQTRrVDLCKpFYA-S 
I VMFLAINPGSP-LGCSIHQTFVGDFDCFQFVPDDSQTRrVDIGKDFYA-F 
I VHrXSIHPGAP-AGCarNaTrVGSFNaTHrEAF0NQ8RWOFGKDYYAX.a 



S. pomha invJ 351 1 ALTHSSSANADVICVCHASMHQYTNQAPTQVF-RSAhTVARKFTLRDVPQ 

S. occidentaiia 311. : QTr — aCVEHGVLCLAHASHHQYADaVPTNPH-RSSTSLARMYTLRYVMQ 

S. ceraviffiae 5VC2. 303 i TFrNTDPTYGSALGI AHASHHEY9AFVPTNPH-nSBrtSL.VRKFSLHTEYQ 
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, disruption 
host mutant strain 
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(54) Inducible promoter and secretion signal for use in schizosaccharomyces pombe, expression 
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(57) The present invention relates to an inducible 
promoter gene for use in the fission yeast Schizosac- 
charomyces pombe , an inducible expression vector 
containing the inducible promoter gene, Schizosaccha- 
romyces pombe transformed with the inducible expres- 
sion vector and a process for producing a heterologous 
protein by using the Schizosaccharomyces pombe 
transform ant. 

The present invention also relates to a secretion 
signal gene for use in Schizosaccharomyces pombe , an 
expression vector containing the secretion signal, 
Schizosaccharomyces pombe transformed with the ex- 
pression vector and a process for producing a heterol- 
ogous protein by using the Schizosaccharomyces 
pombe transformant. These inducible promoter gene 
and secretion signal gene are originated from the S. 
pombe gene invoh^ed in expression of the Invertase pre- 
cursor 

The inducible promoter gene is originated from the 
region responsible for the catabolite repression of the 



invertase precursor. When a Schizosaccharomyces 
pombe transfomnant carrying the inducible expression 
vector is incubated, the timing of the production of the 
desired heterologous protein can be controlled by reg- 
ulating the expression of the heterologous protein struc- 
tural gene depending on the presence or absence of a 
specific nutrient (especially glucose). 

The secretion signal gene encodes the secretion 
signal for the invertase precursor When a Schizosac- 
charomyces pombe transformant carrying an expres- 
sion vector containing the secretion signal gene is incu- 
bated, a heterologous protein fused with the secretion 
signal is synthesized as a precursory protein in the cell, 
and after intracellular separation of the secretion signal, 
the heterologous protein is secreted from the cell. 
Therefore, it Is possible to obtain the desired protein 
from the cell culture, and the desired heterologous pro- 
tein can be purified easily. 
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